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(57) Abstract 

The invention bascA in part on the di.^covery of a number of genes (HRPCa 2. HRPCa 3, HRPCa 6n. HRPCa 8, lIRPCa 9, 
HRPCa 10 HRPCa 13 HRPCa 14 HRPCi 15, and HRPCa 19) which are androgen-inducible in androgen-dependcnt prostate cancer (e.g., 
LNCaP cells) cells and constitutively expressed in androgen-independent prostate cancer cells (e.g., LN3 LNCaP cells). B«:au.se^drogcn 
is required for growth and survival of androgen-dependent prostate cancer cells, HRPCa 2. HRPCa 3. HRPCa 6/7. HRPCa 8. HRPCa 9. 
HRPCa 10 HRPCa 13 HRPCa 14. HRPCa IS. and HRPCa 19 are therapeutic targets. An agent which decreases die expression or acaviiy 
of HRPCa'2 HRPCa 3. HRPCa 6A7, HRPCa 8, HRPCa 9. HRPCa 10. HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 may slow or 
arrest the growth of prostate cancer cells or may kill prosutc cancer cells. Moreover, because die genes of the mvenuon are constitul.vely 
expressed by androgen-independent prostate cancer cclU. they can be used to idenUfy agents useful for the treatment of androgen-dependent 
and androgen-independent prostate cancer. , 
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EXPRESSION ANALYSIS OF SPECFFIC NUCLEIC ACIDS AND POLYPEPTIDES USEFUL IN THE DIAGNOSIS AND 
TREATMENT OF PROSTATE CANCER 



5 Related Application Information 

This application is a continuation-in-part of 
application serial number 09/164,155, filed September 30, 
1998 . 

Background of the Invention 

10 Prostate cancer is the most commonly diagnosed 

cancer and the second most common cause of death from 
cancer in American men. Prostate cancer cells often 
initially rely on androgen (e.g., testosterone) for their 
growth and maintenance.- Therefore, androgen withdrawal, 

15 by castration or through the use of an anti- androgenic 
drug, is a common treatment for prostate cancer. In many 
cases, however, prostate cancer patients develop 
androgen- independent prostate cancer so that androgen 
withdrawal treatment is no longer effective. 

20 The complex process of prostate tumor growth and 

development involves multiple gene products. Therefore, 
it is important to identify genes involved in tumor 
development, growth, and androgen dependence, 
particularly those genes and gene products that can serve 

2 5 as targets for the diagnosis, prevention, and treatment 
of prostate cancer. 

Summary of the Invention 
The invention is based in part on the discovery of 
a number of genes which are androgen- inducible in" 
30 androgen -dependent prostate cancer cells (e.g., LNCaP 
cells) and constitucively expressed in androgen- 
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independent prostate cancer cells (e.g., LN3 LNCaP 
cells) . These genes are referred to herein by their 
clone names: HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 
9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, and HRPCa 19. 
5 These genes are collectively referred to as "the genes of 
the inventions." The proteins encoded by the genes of 
the invention and fragments thereof are collectively 
referred to as the "proteins of the invention" or "the 
polypeptides of the invention." Nucleic acid molecules 
10 encoding a protein of the invention or a polypeptide of 
the invention are collectively referred to as "nucleic 
acids of the invention." 

HRPCa 2. HRPCa 3. HRPCa 6/7, HRPCa 8. HRPCa 9. HRPCa 10. 
HRPCa 13. HRPCa 14. HRPCa 15, and HRPCa 19 

15 The polypeptides, and nucleic acids of the 

invention are potential targets for the development of 
therapeutic compounds. Because androgen is required for 
growth and survival of androgen-dependent prostate cancer 
cells, genes such HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 

20 HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, and 
HRPCa 19 whose expression is increased in the presence 
of testosterone and constitutively expressed in androgen- 
independent cancer cells are potential therapeutic 
targets. An agent which decreases the expression or 

25 activity of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 
9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 
may slow or arrest the growth of prostate cancer cells or 
may kill prostate cancer cells. Moreover, because the 
genes of the invention are constitutively expressed by 

30 androgen- independent prostate cancer cells, they can be 
used to identify agents useful for the treatment of 
androgen-dependent and androgen- independent prostate 
cancer. For example, an agent which reduces the 
expression or activity of a gene of the invention, e.g., 

35 HRPCa 9, may reduce the growth of androgen- independent 
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prostate cancer or cause an androgen- dependent prostate 
cancer to become androgen -dependent so that it can be 
treated with standard androgen withdrawal therapy. Of 
course, such an agent might also be useful for the 
5 treatment of an androgen- dependent prostate cancer. 

The genes, polypeptides, and nucleic acids of the 
invention also have diagnostic uses. For example, by 
examining the expression of one or more of the genes of 
the invention (e.g., measuring alterations in the level 

10 of a protein or nucleic acid of the invention) in a 
sample of prostate cancer cells, it may be possible to 
determine whether the cancer cells are androgen- dependent 
or androgen- independent . For example, if expression of 
HRPCa 9 in a sample of prostate cancer cells obtained 

15 from a patient is not androgen inducible, i.e., it is 
constitutively expressed, the prostate cancer is likely 
androgen- independent . This analysis allows one to 
predict whether androgen withdrawal therapy is likely to 
be beneficial to that patient. Thus, the analysis allows 

20 one to predict whether a selected compound, e.g., an 
anti -androgenic compound, can be used to treat the 
prostate cancer. Importantly, this determination can be 
made on a patient by patient basis. Thus, one can 
determine whether or not a particular prostate cancer 

25 treatment is likely to benefit a particular patient. 

The invention also features diagnostic methods and 
prognostic methods which can be used to identify patients 
having or at risk for developing and androgen- independent 
prostate cancer. The genes, polypeptides, and nucleic 

30 acids of the invention can be used to identify cells 
exhibiting or predisposed to developing prostate cancer 
(e.g., androgen- independent prostate cancer) thereby 
diagnosing individuals having, or at high risk for 
developing, prostate cancer (e.g., androgen- independent 

35 prostate cancer) . 
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In the various methods of the invention, gene 
expression can be measured at the mRNA or protein level. 
Alternatively, expression can be measured indirectly by 
measuring the activity of the protein encoded by the 
5 identified gene. 

In another aspect, the present invention provides 
a method for detecting the presence of activity or 
expression of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 

9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 in 
10 a biological sample by contacting the biological sample 

v/ith an agent capable of detecting an indicator of HRPCa 
2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 

13, HRPCa 14, HRPCa 15, or HRPCa 19 activity such that 
the presence of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 

15 HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 
19 activity is detected in the biological sample. 

In another aspect, the invention provides a method 
for treating prostate cancer by modulating the expression 
or activity of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 

20 HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 
19, the method comprising contacting a cell with an agent 
that modulates (inhibits or stimulates) HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 

14, HRPCa 15, or HRPCa 19 activity or expression such 

25 that HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 

10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 activity or 
expression in the cell is modulated. . In one embodiment, 
the agent is an antibody that specifically binds to a 
polypeptide or protein of the invention. 

30 In another embodiment, the agent modulates 

expression of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 
9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 by 
modulating transcription of a gene of the invention, 
modulating splicing of an mRNA produced by a gene of the 

35 invention, or modulating translation of an mRNA produced 
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by a gene of the invention. In yet another embodiment, 
the agent is a nucleic acid molecule having a nucleotide 
sequence that is antisense to the coding strand of HRPCa 
2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 
5 13, HRPCa 14, HRPCa 15, or HRPCa 19. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a prostate 
cancer characterized by aberrant HRPCa 2, HPRCa 3, HRPCa 
6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, 

10 HRPCa 15, or HRPCa 19 protein activity or nucleic acid 
expression by administering an agent which is a HRPCa 2, 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
HRPCa 14, HRPCa 15, or HRPCa 19 modulator to the subject. 
The modulator can be a peptide, peptidomimetic, or small 

15 molecule, e.g., an organic molecule. 

The present invention also provides a diagnostic 
assay for identifying whether a patient has or is at risk 
of developing prostate cancer, e.g., an androgen- 
independent prostate cancer, by detecting the presence or 

20 absence of a genetic lesion or mutation characterized by 
at least one of: (i) aberrant modification or mutation 
of a gene of the invention; (ii) mis-regulation of a gene 
of the invention; and (iii) aberrant post- translational 
modification of a protein encoded by a gene of the 

25, invention. 

In another aspect, the invention provides a 
method for identifying a compound for the treatment of 
prostate cancer, e.g., an androgen- independent prostate 
cancer by indent if ying a compound that binds to or 

30 modulates the activity of a protein of the invention. In 
general , such methods entail measuring a biological 
activity of a protein of the invention in the presence 
and absence of a test compound and identifying those 
compounds which alter the activity of a protein of the 

35 invention. 
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The invention also features methods for 
identifying a compound which modulates the expression of 
a gene of the invention (at the mRNA or protein level) by 
measuring the expression of a nucleic acid or protein of 
5 the invention in the presence and absence of a compound. 
HRPCa 13 and Peripheral -Type Benzodiazepine Receptor 

HRPCa 13 encodes endozepine, a protein that is 
involved in steroid synthesis (Hall et al. (1995) 
Molecular Neurobiol. 10:1-17). Endozepine binds to the 

10 peripheral -type benzodiazepine receptor (PBR) , a receptor 
that was first identified as a binding site for 
benzodiazepine diazepam (valium) . PBR is a 60 amino acid 
protein (GenBank Accession No. U12421) . PBR is expressed 
by nearly all tissues, and is expressed at the highest 

15 levels in steroid-producing tissues. In these tissues, 
PBR is known to regulate the transport of cholesterol 
from the outer membrane of mitochondria to the inner 
membrane. This transport step is likely the rate 
limiting step in steroid biosynthesis. PBR has been 

20 cloned (Riond et al. (1991) Eur. J". Biochem. 11:471-480). 

Given that HRPCa 13 (endozepine) is a target for 
the development of therapeutic compounds useful for, 
e.g., the treatment of cancer, particularly prostate 
cancer, PBR is also a target for the development of such 

25 therapeutic compounds. Thus, the invention features 
methods for identifying compounds which bind to PBR, 
reduce the activity of PBR, or reduce the expression of 
PBR. 

The invention features methods for identifying 
30 therapeutic compounds by measuring the binding of a test 
compound to PBR. Compounds which bind to PBR may, like 
compounds which reduce HRPCa 13 expression or activity 
may be useful for the treatment of androgen- independent 
prostate cancer or androgen- dependent prostate cancer. 
35 The binding of a test compound to PBR can be measured in 
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the presence or absence of endozepine. Particularly 
useful compounds are those which successfully compete 
with, e.g., have a higher affinity for PBR, endozopine. 
The invention also features a method for 
5 identifying therapeutic compounds by identifying a 
compound which alters, e.g., decreases, the activity, 
e.g., an endozopine -induced activity, of PBR. 

There are a number of assays which have been used 
to measure the binding of various ligands (e.g., PK11195, 

10 Ro5-4864, diazepam, (-) PK 14067, and (+) PK 14068) to 
PBR (Chang et al . (1992) DNA Cell Biol.; Broaddus et al . 
(1990) Brain Res. 518:199-208; Alexander et al . (1992) 
Biochem. Pharmacol. 44:269-274; and Olson et al., (1998) 
Cancer Res. 48:5837-41). These assays can be adapted to 

15 identify compounds which reduce the binding of endozapine 
to PBR or to identify compounds that bind to PBR. 

In addition, the invention features a method for 
identifying a therapeutic compound, e.g., a compound 
useful for the treatment of androgen- dependent or 

20 androgen- independent cancer, e.g., prostate cancer, by 
measuring the expression of PBR in the presence and 
absence of a test compound. Compounds, that decrease the 
expression of PBR, like compounds that decrease the 
expression of HPRCa 13 , may be useful for the treatment 

25 of prostate cancer. 

PBR may also have diagnostic applications. 
Increased expression of PBR in prostate cancer cells 
might be associated with androgen- independent prostate 
cancer. Thus, by examining the expression of PBR (e.g., 

30 by measuring PBR protein or nucleic acid) in sample 
comprising prostate cells, it may be possible to 
determine whether the prostate cancer cells are androgen- 
dependent or androgen independent. In addition, the 
invention features diagnostic and prognastic methods 

35 which can be used to identify patients having or at risk 
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for developing androgen -independent prostate cancer by 
measuring the expression or activity of PBR. 

The invention also features methods of treating 
prostate cancer by modulating the expression or activity 
5 of PBR, the method comprising contacting a cell with an 
agent that modulates (inhibits) PBR expression or 
activity such that PBR expression or activity in the cell 
is modulated {inhibited or reduced) . 
Definitions 

10 Differential expression refers to both 

quantitative, as well as qualitative, differences in the 
expression pattern of a gene in tumor cells treated with 
a particular drug and untreated tumor cells. A 
differentially expressed gene can represent fingerprint 

15 gene and/or a target gene. 

A fingerprint gene is a differentially expressed 
gene whose expression pattern can be utilized as part of 
a prognostic or diagnostic marker for the evaluation of 
prostate cancer (e.g., androgen independent prostate 

20 cancer) or which, alternatively, can be used in methods 
for identifying compounds useful for the treatment of 
prostate cancer or evaluating the effectiveness of a 
prostate cancer treatment. For example, the effect of 
the compound on the fingerprint gene expression pattern 

25 normally displayed in connection with prostate cancer can 
be used to evaluate the efficacy of the compound as a 
treatment for prostate cancer or can, additionally, be 
used to monitor patients undergoing clinical evaluation 
for the treatment of prostate cancer. 

30 A fingerprint pattern is the pattern generated 

v/hen the expression pattern of a series (which can range 
from two up to all the fingerprint genes which exist for 
a given state) of fingerprint genes is determined. A 
fingerprint pattern can be used in the same diagnostic, 
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prognostic and compound identification methods as the 
expression of a single fingerprint gene. 

A target gene is a differentially expressed gene 
involved in prostate cancer such that modulation of the 
5 level of target gene expression or of target gene product 
activity can act to prevent and/or ameliorate symptoms or 
androgen- independence of the prostate cancer. Compounds 
that modulate the expression of the target gene or the 
activity of the target gene product can be used in the 

10 treatment of prostate cancer. Still further, compounds 
that modulate the expression of the target gene or 
activity of the target gene product can be used in 
treatments to deter benign cells from developing into 
prostate cancer cells. Still further, compounds that 

15 modulate the expression of the target gene or activity of 
the target gene product can be used to design a 
preventive intervention in pre-neoplastic cells in 
individuals at high risk. 

An androgen- dependent prostate cancer cell is a 

20 cell that requires androgen for continued cell division. 
Conversely, and androgen- independent prostate cancer cell 
is a cell that can continue to divide in the absence of 
androgen . 

Other features and advantages of the invention 
25 will be apparent from the following detailed description 
and claims. 



Brief Description of the Drawings 

Figure 1 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ) of human 

30 HRPCa 2. 

Figure 2 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ) of human 

HRPCa 3. 
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Figure 3 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ) of human 

HRPCa 6/7. 

Figure 4 depicts the cDNA sequence (SEQ ID NO: ) 

5 and predicted amino acid sequence (SEQ ID NO: ) of human 

HRPCa 9. 

Figure 5 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ) of human 

HRPCa 10. 

10 Figure 6 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ) of human 

HRPCa 13. 

Figure 7 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ) of human 

15 HRPCa 14 . 

Figure 8 depicts the cDNA sequence (SEQ ID NO: ) 

and predicted amino acid sequence (SEQ ID NO: ) of human 

HRPCa 15. 

Figure 9 depicts the cDNA sequence (SEQ ID NO: ) 

20 and predicted amino acid sequence (SEQ ID NO: ) of human 

HRPCa 19. 

Detailed Description of the Invention 
The present invention is based on the discovery of 
ten genes, HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, 

25 HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, and HRPCa 19, 
whose expression is testosterone inducible in androgen- 
dependent prostate cancer cells (LNCaP cells) and whose 
expression is constitutive, i.e., not testosterone 
inducible, in androgen- independent prostate cancer cells 

30 (LN3 LNCaP cells) . Of these, HRPCa 9 and HRPCa 10 
represent novel genes which are described in greater 
detail in U.S. Serial No. 09/163,759, filed September 30, 
1998, the contents of which is hereby incorporated by 
reference . 
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Table 1 provides a summary of certain information 
regarding HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, 
HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, and HRPCa 19 
including the name of the known protein or gene and 
5 Accession Numbers which can be used to obtain the 
sequence of the known genes. 

Table 1: HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8. HRPCa 

9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
and HRPCa 19. 



15 



Clone 


Protein Name 


Accession No. 


Figure 


HrPCa 2 


Stathmir./Oncoprotein 18 


X53305 


1 


HRPCa 3 


Tumor ar.cigen L6 


M90657 


2 


HRPCa 6/7 


TMPRSS2 


U75329 


3 


HRPCa 8 


Basic Transcription 
Factor 2 


Z30094 




HRPCa 9 


Novel 




4 


HRPCa 10 


Novel 




5 


HRPCa 13 


Endozepine 


M42085 


6 


HRPCa 14 


KIAA0095 


D13748 


7 


HRPCa 15 


eIF4AI 


D13748 


8 


HRPCa 19 


1 5 - hydroxypros t ogl adin 
dehydrogenas e 


J05594 


9 



I. Isolated Nucleic Acid Molecules 

HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, 
HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 19, and PER 
nucleic acid molecules are useful in the various methods 
25 of the invention. Described below are various methods 
for the production and use of the nucleic acid molecules 
of the invention. 
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As used herein, the term "nucleic acid molecule" 
is intended to include DNA molecules (e.g., cDNA or 
genomic DNA) and RNA molecules (e.g., mRNA) and analogs 
of the DNA or RNA generated using nucleotide analogs. 
5 The nucleic acid molecule can be single-stranded or 
double- stranded, but preferably is double-stranded DNA. 

An "isolated" nucleic acid molecule is one which 
is separated from other nucleic acid molecules which are 
present in the natural source of the nucleic acid. 

10 Preferably, an "isolated" nucleic acid is free of 

sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 
DNA of the organism from which the nucleic acid is 

15 derived. For example, in various embodiments, the 

isolated HRPCa 9 nucleic acid molecule can contain less 
than about 5kb, 4kb, 3kb, 2kb, Ikb, O.SkborO.lkb 
of nucleotide sequences which naturally flank the nucleic 
acid molecule in genomic DNA of the cell from which the 

20 nucleic acid is derived. Moreover, an "isolated" nucleic 
acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture 
medium when produced by recombinant techniques, or 
substantially free of chemical precursors or other 

25 chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention 
can be isolated using standard molecular biology 
techniques and the sequence information provided herein. 
Using all or a portion of the nucleic acid sequences of a 

30 gene of the invention as a hybridization probe, HRPCa- 9 
nucleic acid molecules can be isolated using standard 
hybridization and cloning techniques (e.g., as described 
in Sambrook et al . , eds.. Molecular Cloning: A 
Laboratory Manual. 2nd, ed. , Cold Spring Harbor 
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Laboratory, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, NY, 1989) . 

A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
5 appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so 
amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to a nucleic acid of the 

10 invention can be prepared by standard synthetic 

techniques, e.g., using an automated DNA synthesizer. 

The nucleic acid molecules of the invention can 
comprise only a portion of a nucleic acid sequence of a 
gene of the invention, for example, a fragment which can 

15 be used as a probe or primer or a fragment encoding a 
biologically active portion "of a protein of the 
invention. The probe/primer typically comprises 
substantially purified oligonucleotide. The 
oligonucleotide typically comprises a region of 

20 nucleotide sequence that hybridizes under stringent 
conditions to at least about 12, preferably about 25, 
more preferably about 50, 75, 100, 125, 150, 175, 200, 
250, 300, 350 or 400 consecutive nucleotides of the sense 
or anti-sense sequence a nucleic acid of the invention. 

25 Such probes and primers are useful for measuring 

the expression of a gene of the invention. For example, 
probes based on the human HRPCa 9 nucleotide sequence can 
be used to detect transcripts or genomic sequences 
encoding the same or identical proteins. The probe 

30 comprises a label group attached thereto, e.g., a 

radioisotope, a fluorescent compound, an enzyme, or an 
enzyme co- factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissues 
which mis-express a HRPCa 9 protein, such as by measuring 

35 a level of a HRPCa 9-encoding nucleic acid in a sample of 
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cells from a subject, e.g., detecting HRPCa-9 mRNA levels 
or determining whether a genomic HRPCa 9 gene has been 
mutated or deleted. 

In addition to the nucleotide sequences disclosed 
5 herein, it will be appreciated by those skilled in the 
art that DNA sequence polymorphisms that lead to changes 
in the amino acid sequence encoded by a gene of the 
invention may exist within a population (e.g., the human 
population) . Such genetic polymorphisms may exist among 

10 individuals within a population due to natural allelic 
variation. An allele is one of a group of genes which 
occur alternatively at a given genetic locus. Such 
natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of the HRPCa 9 gene. 

15 Alternative alleles can be identified by sequencing the 
gene of interest in a number of different individuals. 
This can be readily carried out by using hybridization 
probes to identify the same genetic locus in a variety of 
individuals. The presence of a given allele may be 

20 diagnostic for prostate cancer. 

Nucleic acid molecules which hybridize xmder 
stringent conditions to a gene of the invention are 
useful diagnostically and therapeutically. As used 
herein, the term "hybridizes under stringent conditions" 

25 is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 60% 
(65%, 70%, preferably 75%) identical to each other 
typically remain hybridized to each other. Such 
stringent conditions are )cnown to those skilled in the 

30 art and can be found in Current Protocols in Molecular 
Biolo^, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. A 
preferred, non-limiting example of stringent 
hybridization conditions are hybridization in 6X sodium 
chloride/sodium citrate (SSC) at about 45°C, followed by 

35 one or more washes in 0.2 X SSC, 0.1% SDS at 50-65°C. 
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Preferably, an isolated nucleic acid molecule of the 
invention that hybridizes under stringent conditions to 
the sequence of a gene of the invention corresponds to a 
naturally-occurring nucleic acid molecule. As used 
5 herein, a "naturally-occurring" nucleic acid molecule 
refers to an RNA or DNA molecule having a nucleotide 
sequence that occurs in nature (e.g., encodes a natural 
protein) . 

An isolated nucleic acid molecule encoding a 
10 protein having a sequence which differs from that encoded 
by a gene of the invention can be created by introducing 
one or more nucleotide substitutions, additions or 
deletions into the nucleotide sequences disclosed herein 
such that one or more amino acid substitutions, additions 
15 or deletions are introduced into the encoded protein. 

Mutations can be introduced by standard techniques, such 
as site-directed mutagenesis and PCR-mediated 
mutagenesis. Preferably, conservative amino acid 
substitutions are made at one or more predicted non- 
20 essential amino acid residues. A "conservative amino 

acid substitution" is one in which the amino acid residue 
is replaced with an amino acid residue having a similar 
side chain. Families of amino acid residues having 
similar side chains have been defined in the art. These 
25 families include amino acids with basic side chains 

(e.g., lysine, arginine, histidine) , acidic side chains 
(e.g., aspartic acid, glutamic acid), uncharged polar 
side chains (e.g., glycine, asparagine, glutamine, 
serine, threonine, tyrosine, cysteine) , nonpolar side 
30 chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan), beta- 
branched side chains (e.g., threonine, valine, 
isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine) . Thus, a predicted 
35 nonessential amino acid residue is preferably replaced 
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with another amino acid residue from the same side chain 
family. Alternatively, mutations can be introduced 
randomly along all or part of a coding sequence, such as 
by saturation mutagenesis, and the resultant mutants can 
5 be screened for biological activity to identify mutants 
that retain activity. Following mutagenesis, the encoded 
protein can be expressed recombinantly and the activity 
of the protein can be determined. 

Antisense nucleic acid molecules, i.e., molecules 

10 which are complementary to a sense nucleic acid, encoding 
a protein, e.g., complementary to the coding strand of a 
double-stranded cDNA molecule or complementary to an mRNA 
sequence, are useful in the methods of the invention. 
The antisense nucleic acid can be complementary to an 

15 entire coding strand of a gene of the invention, or to 
only a portion thereof, e.g., all or part of the protein 
coding region (or open reading frame) . An antisense 
nucleic acid molecule can be antisense to a noncoding 
region of the coding strand. The noncoding regions ("5' 

20 and 3' untranslated regions") are the 5' and 3' sequences 
which flank the coding region and are not translated into 
amino acids . 

Given the coding strand sequences disclosed 
herein, antisense nucleic acids of the invention can be 

25 designed according to the rules of Watson and Crick base 
pairing. For example, the antisense oligonucleotide can 
be complementary to the region surrounding the 
translation start site of an mRNA produced by a gene of 
the invention. An antisense oligonucleotide can be, for 

30 example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 
nucleotides in length and can be constructed using 
chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense 
nucleic acid (e.g., an antisense oligonucleotide) can be 

3 5 chemically synthesized using naturally occurring 
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nucleotides or variously modified nucleotides designed to 
increase the biological stability of the molecules or to 
increase the physical stability of the duplex formed 
between the antisense and sense nucleic acids, e.g., 
5 phosphorothioate derivatives and acridine substituted 
nucleotides can be used. Examples of modified 
nucleotides which can be used to generate the antisense 
nucleic acid include 5-f luorouracil , 5-bromouracil , 5- 
chlorouracil , 5-iodouracil , hypoxanthine , xanthine, 4- 

10 acetylcytosine, 5- (carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl -2 - thiouridine , 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D- 
galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
methylguanine, 1-methylinosine, 2 , 2 -dimethylguanine, 2- 

15 methyladenine, 2 -methyl guanine, 3-methylcytosine, 5- 
methylcytosine, N6-adenine, 7 -methylguanine , 5- 
methylaminomethyluracil , 5 -methoxyaminomethyl -2 - 
thiouracil, beta-D-mannosylqueosine , 5'- 
methoxycarboxymethyluracil , 5-methoxyuracil , 2- 

20 methylthio-N6-isopentenyladenine, uracil- 5 -oxyacetic acid 
(v) , wybutoxosine, pseudouracil , queosine, 2- 
thiocytosine, 5-methyl-2-thiouracil , 2 -thiouracil , 4- 
thiouracil, 5-methyluracil , uracil -5-oxyacetic acid 
methylester, uracil -5-oxyacetic acid (v) , 5-methyl-2- 

25 thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil, 
{acp3)w, and 2, 6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using 
an expression vector into which a nucleic acid has been- 
subcloned in an antisense orientation (i.e., RNA 

30 transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of 
interest, described further in the following subsection) . 

In a therapeutic context, an antisense nucleic 
acid molecule is typically administered to a subject or 
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generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a protein of 
the invention to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or 
5 translation. The hybridization can be by conventional 
nucleotide complementarity to form a stable duplex, or, 
for example, in the case of an anti sense nucleic acid 
molecule which binds to DNA duplexes, through specific 
interactions in the major groove of the double helix. An 

10 example of a route of administration of antisense nucleic 
acid molecules includes direct injection at a tissue 
site. Alternatively, antisense nucleic acid molecules 
can be modified to target selected cells and then 
administered systemically . For example, for systemic 

15 administration, antisense molecules can be modified such 
that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking 
the antisense nucleic acid molecules to peptides or 
antibodies which bind to cell surface receptors or 

20 antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 

25 control of a strong pol II or pol III promoter are 
preferred. 

An antisense nucleic acid molecule of the 
invention can be an of-anomeric nucleic acid molecule. An 
Qf-anomeric nucleic acid molecule forms specific double- 

30 stranded hybrids with complementary RNA in which, 

contrary to the usual /?-units, the strands run parallel 
to each other (Gaultier et al . (1987) Nucleic Acids. Res. 
15:6625-6641) . The antisense nucleic acid molecule can 
also comprise a 2 ' -o-methylribonucleotide (Inoue et al . 

35 (1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
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RNA-DNA analogue (Inoue et al. (1987) FEES Lett. 215:327- 
330) . 

Ribozymes are also useful in the methods of the 
invention, e.g., to decrease expression of a protein of 
5 the invention. Ribozymes are catalytic RNA molecules 
with ribonuclease activity which are capable of cleaving 
a single-stranded nucleic acid, such as an mRNA, to which 
they have a complementary region. Thus, ribozymes (e.g., 
hammerhead ribozymes (described in Haselhoff and Gerlach 

10 (1988) Nature 334:585-591)) can be used to catalytically 
cleave an mRNA transcript produced by a gene of the 
invention to thereby inhibit translation of the mRNA. A 
ribozyme having specificity for a given nucleic acid can 
be designed based upon the nucleotide sequences disclosed 

15 herein. For example, a derivative of a Tetrahymena L-19 
IVS RNA can be constructed in which the nucleotide 
sequence of the active site is complementary to the 
nucleotide sequence to be cleaved in a HRPCa 9-encoding 
mRNA. See, e.g., Cech et al . U.S. Patent No. 4,987,071; 

20 and Cech et al. U.S. Patent No. 5,116,742. 

Alternatively, HRPCa 9 mRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity 
from a pool of RNA molecules. See, e.g., Bartel and 
Szostak (1993) Science 261:1411-1418. 

25 Also useful in the methods of the invention are 

nucleic acid molecules which form triple helical 
structures. For example, HRPCa 9 gene expression can be 
inhibited by targeting nucleotide sequences complementary 
to the regulatory region of the HRPCa-9 (e.g., the HRPCa 

30 9 promoter and/or enhancers) to form triple helical 

structures that prevent transcription of the HRPCa 9 gene 
in target cells. See generally, Helene (1991) Anticancer 
Drug Des. 6 (6) : 569-84 ; Helene (1992) Ann. N.Y. Acad. Sci . 
660:27-36; andMaher (1992) Bioassays 14 (12) : 807-15 . 



wo 00/18961 



PCT/US99/22535 



- 20 - 

Nucleic acid molecules that are modified at the 
base moiety, sugar moiety or phosphate backbone to 
improve, e.g., the stability, hybridization, or 
solubility of the molecule are useful in the methods of 
5 the invention. For example, the deoxyribose phosphate 
backbone of the nucleic acids can be modified to generate 
peptide nucleic acids (see Hyrup et al . (1996) Bioorganic 
& Medicinal Chemistry 4(1): 5-23). As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to nucleic 

10 acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under 

15 conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al . 
(1996) supra; Perry-0' Keef e et al. (1996) Proc. Natl. 
Acad. Sci. USA 93: 14570-675. 

20 PNAs can be used in therapeutic and diagnostic 

applications. For example, PNAs can be used as antisense 
or antigene agents for sequence -specific modulation of 
gene expression by, e.g., inducing transcription or 
• translation arrest or inhibiting replication. PNAs can 

25 also be used, e.g., in the analysis of single base pair 
mutations in a gene by, e.g., PNA directed PGR clamping; 
as artificial restriction enzymes when used in 
combination with other enzymes, e.g., SI nucleases (Hyrup 
(1996) supra; or as probes or primers for DNA sequence 

30 and hybridization (Hyrup (1996) supra; Perry-O' Keefe et 
al. (1996) Proc. Natl. Acad. Sci. USA 93: 14670-675). 

PNAs can be modified, e.g., to enhance their 
stability or cellular uptake, by attaching lipophilic or 
other helper groups to PNA, by the formation of PNA-DNA 

35 chimeras, or by the use of liposomes or other techniques 
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of drug delivery known in the art. For example, PNA-DNA 
chimeras of HRPCa 9 can be generated which may combine 
the advantageous properties of PNA and DNA. Such 
chimeras allow DNA recognition enzymes, e.g., RNAse H and 
5 DNA polymerases, to interact with the DNA portion while 
the PNA portion would provide high binding affinity and 
specificity. PNA-DNA chimeras can be linked using 
linkers of appropriate lengths selected in terms of base 
stacking, number of bonds between the nucleobases, and 

10 orientation (Hyrup (1996) supra) . The synthesis of PNA- 
DNA chimeras can be performed as described in Hyrup 
(1996) supra and Finn et al . (1996) Nucleic Acids 
Research 24 (17) :3357-63 . For example, a DNA chain can be 
synthesized on a solid support using standard 

15 phosphoramidite coupling chemistry and modified 
nucleoside analogs. Compounds such as 5'-(4- 
methoxytrityl) amino-5 ' -deoxy- thymidine phosphoramidite 
can be used as a link between the PNA and the 5' end of 
DNA (Mag et al . (1989) Nucleic Acid Res. 17:5973-88). 

20 PNA monomers are then coupled in a stepwise manner to 
produce a chimeric molecule with a 5' PNA segment and a 
3' DNA segment (Finn et al . (1996) Nucleic Acids Research 
24 (17) : 3357-63) . Alternatively, chimeric molecules can 
be synthesized with a 5' DNA segment and a 3' PNA segment 

25 (Peterser et al . (1975) Bioorganic Med. Chem. Lett. 
5 :1119-11124) . 

The oligonucleotide may include other appended 
groups such as peptides (e.g., for targeting host cell 
receptors in vivo) , or agents facilitating transport 

30 across the cell membrane (see, e.g., Letsinger et al . 

(1989) Proc. Natl. Acad. Sci. USA 86:6553-6556; Lemaitre 
et al. (1987) Proc. Natl. Acad. Sci. USA 84:648-652; PCX 
Publication No. WO 88/09810) or the blood-brain barrier 
(see, e.g., PCX Publication No. WO 89/10134). In 

35 addition, oligonucleotides can be modified with 
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hybridization-triggered cleavage agents (see, e.g., Krol 
et al. (1988) Bio/Techniques 6:958-976) or intercalating 
agents {see, e.g., Zon (1988) Pharm. Res. 5:539-549). To 
this end, the oligonucleotide may be conjugated to 
5 another molecule, e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, 
hybridization- triggered cleavage agent, etc. 

II. Isolated Proteins and Antibodies 

Isolated proteins or the invention, and 

10 biologically active portions thereof, as well as 

polypeptide fragments suitable for use as immunogens to 
raise antibodies directed against a protein of the 
invention are useful in the diagnostic and therapeutic 
methods of the invention. Native proteins of the 

15 invention can be isolated from cells or tissue sources by 
an appropriate purification scheme using standard protein 
purification techniques. Proteins of the invention can 
also be produced by recombinant DNA techniques or 
chemically using standard peptide synthesis techniques. 

2 0 An "isolated" or "purified" protein or 

biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
the cell or tissue source from which the protein is 
derived, or substantially free of chemical precursors or 

25 other chemicals when chemically synthesized. The 
language "substantially free of cellular material" 
includes preparations in which the protein is separated 
from cellular components of the cells from which it is 
isolated or recombinantly produced. Thus, a protein that 

30 is substantially free of cellular material includes 

preparations of the protein having less than about 30%, 
20%, 10%, or 5% (by dry weight) of other proteins (also 
referred to herein as a "contaminating protein") . When 
the protein or biologically active portion thereof is 
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recombinantly produced, it is also preferably 
substantially free of culture medium, i.e., culture 
medium represents less than about 20%, 10%, or 5% of the 
volume of the protein preparation. When the protein is 
5 produced by chemical synthesis, it is preferably 
substantially free of chemical precursors or other 
chemicals, i.e., it is separated from chemical precursors 
or other chemicals which are involved in the synthesis of 
the protein. Accordingly such preparations of the 

10 protein have less than about 30%, 20%, 10%, 5% (by dry 
weight) of chemical precursors or other proteins or 
polypeptides . 

A biologically active portion of a protein of the 
invention can be useful in screening assays. Such 

15 biologically active portions include peptides comprising 
amino acid sequences sufficiently identical to or derived 
from the native amino acid, which include fewer amino 
acids than the full length protein, and exhibit at least 
one activity of the full length protein. 

20 Moreover, other biologically active portions, in 

which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one 
or more of the functional activities of the native 
protein. 

25 Useful proteins are proteins which include an 

amino acid sequence at least about 45%, preferably 55%, 
65%, 75%, 85%, 95%, or 99% identical to the amino acid 
sequence of a protein of the invention and retain a 
functional activity of a protein of the invention. 

30 To determine the percent identity of two amino 

acid secjuences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 
be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 

35 amino or nucleic acid sequence) . The amino acid residues 
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or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 
in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
5 the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 
positions shared by the sequences (i.e., % identity = # 
of identical positions/total # of overlapping positions x 

10 100) . Preferably, the two sequences are the same length. 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of two 

15 sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Wat'I Acad. Sci . USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Nat'l Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
NBLAST and XBLAST programs of Altschul, et al . (1990) J. 

20 Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 
to HRPCa-9 nucleic acid molecules of the invention. 
BLAST protein searches can be performed with the XBLAST 

25 program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to protein molecules of the 
invention. To obtain gapped alignments for comparison 
purposes. Gapped BLAST can be utilized as described in 
Altschul et al., (1997) Nucleic Acids Res. 25:3389-3402. 

3 0 When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See 

http://www.ncbi.nlm.nih.gov. Another preferred, non- 
limiting example of a mathematical algorithm utilized for 
3 5 the comparison of sequences is the algorithm of Myers and 
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Miller, CABIOS (1989) . Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the 
GCG sequence alignment software package. When utilizing 
the ALIGN program for comparing amino acid sequences, a 
5 PAM120 weight residue table, a gap length penalty of 12, 
and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
above, with or without allovjing gaps. In calculating 

10 percent identity, only exact matches are counted. 

Chimeric or fusion proteins which include all or a 
portion of a protein of the invention are useful in 
screening assays and therapeutically. 

One useful fusion protein is a GST fusion protein 

15 in which all or a portion of a protein of the invention 
is fused to the C-terminus of the GST sequences. 

Immunoglobulin fusion proteins in which all or 
part of a protein of the invention is fused to sequences 
derived from a member of the immunoglobulin protein 

20 family are useful therapeutically. An immunoglobulin 
fusion protein can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an 
interaction between a protein of the invention and a 
protein with which it normally interacts. Such inhibition 

25 may be useful therapeutically for the treatment of 
prostate cancer. Moreover, immunoglobulin fusion 
proteins can be used as immunogens to produce antibodies 
in a subject, to purify ligands and in screening assays 
to identify molecules which inhibit the interaction of a 

30 protein of the invention and a protein or other molecule 
with which it normally interacts. Chimeric or fusion 
proteins can be produced by standard techniques (see, 
e.g.. Current Protocols in Molecular Biology, Ausubel et 
al . eds . , John Wiley & Sons: 1992). Moreover, many 

35 expression vectors are commercially available that 



wo 00/18961 



PCTAJS99/22535 



- 26 - 

already encode a fusion moiety (e.g., a GST polypeptide). 
A nucleic acid encoding a polypeptide of interest can be 
cloned into such an expression vector such that the 
fusion moiety is linked in-frame to the polypeptide. 
5 Variants of the proteins of the invention which 

function as either agonists (mimetics) or, preferably, as 
antagonists of a protein of the invention are useful 
therapeutically. Variants can be generated by 
mutagenesis, e.g., discrete point mutation or truncation. 

10 An agonist of a protein can retain substantially the 
same, or a subset, of the biological activities of the 
naturally occurring form of the protein. An antagonist 
of a protein can inhibit one or more of the activities of 
the naturally occurring form of the protein by, for 

15 example, competitively binding to a downstream or 

upstream member of a cellular signaling cascade which 
includes the protein. 

Variants of a protein which function as either an 
agonist (mimetic) or antagonist can be identified by 

20 screening combinatorial libraries of mutants, e.g., 
truncation mutants, for protein agonist or antagonist 
activity. A variegated library of can be generated by 
combinatorial mutagenesis at the nucleic acid level and 
is encoded by a variegated gene library. A variegated 

25 library of variants can be produced by, for example, 
enzymatically ligating a mixture of synthetic 
oligonucleotides into gene sequences such that a 
degenerate set of potential protein sequences is 
expressible as individual polypeptides, or alternatively, 

30 as a set of larger fusion proteins (e.g., for phage 

display) containing the set of variant protein sequences 
therein. There are a variety of methods which can be 
used to produce libraries of potential variants from a 
degenerate oligonucleotide sequence. Chemical synthesis 

35 of a degenerate gene sequence can be performed in an 
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automatic DNA synthesizer, and the synthetic gene then 
ligated into an appropriate expression vector. Use of a 
degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set 
5 of potential protein sequences. Methods for synthesizing 
degenerate oligonucleotides are known in the art (see, 
e.g., Narang (1983) Tetrahedron 39:3; Itakura et al . 
(1984) Annu. Rev. Biochem. 53:323; Itakura et al . (1984) 
Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 
10 11:477). 

In addition, libraries of fragments of a protein 
coding sequence can be used to generate a variegated 
population of fragments for screening and subsequent 
selection of variants. A library of coding sequence 

15 fragments can be generated by treating a double stranded 
PGR fragment of protein coding sequence with a nuclease 
under conditions wherein nicking occurs only about once 
per molecule, denaturing the double stranded DNA, 
renaturing the DNA to form double stranded DNA which can 

20 include sense/antisense pairs from different nicked 

products, removing single stranded portions from reformed 
duplexes by treatment with SI nuclease, and ligating the 
resulting fragment library into an expression vector. By 
this method, an expression library can be derived which 

25 encodes N-terminal and internal fragments of various 
sizes of the protein of interest. 

Several techniques are known in the art for 
screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 

30 libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis 
of a protein of the invention. The most widely used 
techniques, which are amenable to high through-put 

35 analysis, for screening large gene libraries typically 
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include cloning the gene library into replicable 
expression vectors, transforming appropriate cells with 
the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection 
5 of a desired activity facilitates isolation of the vector 
encoding the gene whose product was detected. Recursive 
ensemble mutagenesis (REM) , a technique which enhances 
the frequency of functional mutants in the libraries, can 
be used in combination with the screening assays to 

10 identify variants (Arkin and Yourvan (1992) Proc. Natl. 
Acad. Sci. USA 89:7811-7815; Delgrave et al . (1993) 
Protein Engrineerinsr 6 (3) : 327-331) . 

An isolated protein of the invention, or a portion 
or fragment thereof, can be used as an immunogen to 

15 generate antibodies that bind to the protein using 
standard techniques for polyclonal and monoclonal 
antibody preparation. Such antibodies are useful 
therapeutically (e.g., for inhibiting protein activity) 
and diagnostically . A full-length protein or an 

20 antigenic fragment thereof can be used as an immunogens. 
Preferred antigenic peptides comprises at least 8 
(preferably 10, 15, 20, or 30) amino acid residues of the 
protein of interest and encompass an epitope of the 
protein such that an antibody raised against the peptide 

25 forms a specific immune complex with the protein. 

An immunogen typically is used to prepare 
antibodies by immunizing a suitable subject, (e.g., 
rabbit, goat, mouse or other mammal) with the immunogen. 
An appropriate immunogenic preparation can contain, for 

30 example, recombinantly expressed protein or a chemically 
synthesized polypeptide. The preparation can further 
include an adjuvant, such as Freund's complete or 
incomplete adjuvant, or similar immunostimulatory agent. 
Immunization of a suitable subject with an immunogenic 

35 preparation induces a polyclonal antibody response. 



wo 00/18961 



PCT/US99/22535 



- 29 - 

The term "antibody" as used herein refers to 
immunoglobulin molecules and immunologically active 
portions of immunoglobulin molecules, i.e., molecules 
that contain an antigen binding site which specifically 
5 binds an antigen, such as HRPCa 9, i.e., binds HRPCa 9, 
but does not substantially bind other molecules in a 
sample, e.g., a biological sample, which contains HRPCa 
9. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab')2 

10 fragments which can be generated by treating the antibody 
with an enzyme such as pepsin. 

Polyclonal antibodies can be prepared as described 
above by immunizing a suitable subject with a suitable 
immunogen. The antibody titer in the immunized subject 

15 can be monitored over time by standard techniques, such 
as with an enzyme linked immunosorbent assay (ELISA) 
using an immobilized form of the protein of interest. If 
desired, the antibody molecules can be isolated from the 
mammal (e.g., from the blood) and further purified by 

20 well-known techniques, such as protein A chromatography 
to obtain the IgG fraction. At an appropriate time after 
immunization, e.g., when the anti- HRPCa-9 antibody 
titers are highest, antibody-producing cells can be 
obtained from the subject and used to prepare monoclonal 

25 antibodies by standard techniques, such as the hybridoma 
technique originally described by Kohler and Milstein 
(1975) Nature 256:495-497, the human B cell hybridoma 
technique (Kozbor et al. (1983) Inmunol Today 4:72), the 
EBV-hybridoma technique (Cole et al . (1985), Monoclonal 

30 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 
77-96) or trioma techniques. The technology for 
producing various antibodies, monoclonal antibody 
hybridomas is well known (see generally Current Protocols 
in Immunology (1994) Coligan et al . (eds.) John Wiley & 

35 Sons, Inc., New York, NY). Briefly, an immortal cell 
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line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with the 
immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are 
5 screened to identify a hybridoma producing a monoclonal 
antibody that binds to the protein of interest. 

Any of the many well known protocols used for 
fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating a monoclonal 

10 antibody directed against a polypeptide of the invention 
(see, e.g.. Current Protocols in Immunology, supra; 
Galfre et al . (1977) Nature 266:55052; R.H. Kenneth, in 
Monoclonal Antibodies : A New Dimension In Biological 
Analyses, Plenum Publishing Corp. , New York, New York 

15 (1980); and Lerner (1981) Yale J. Biol. Med., 54:387-402. 
Moreover, the ordinarily skilled worker will appreciate 
that there are many variations of such methods which also 
would be useful. Typically, the immortal cell line 
(e.g., a myeloma cell line) is derived from the same 

20 mammalian species as the lymphocytes. For example, 

murine hybridomas can be made by fusing lymphocytes from 
a mouse immunized with an immunogenic preparation of the 
present invention with an immortalized mouse cell line, 
e.g., a myeloma cell line that is sensitive, to culture 

25 medium containing hypoxanthine , aminopterin and thymidine 
("HAT medium") . Any of a number of myeloma cell lines 
can be used as a fusion partner according to standard 
techniques, e.g., the P3-NSl/l-Ag4-l, P3-x63-Ag8 . 653 or 
Sp2/0-Agl4 myeloma lines. These myeloma lines are 

30 available from ATCC. Typically, HAT-sensitive mouse 
myeloma cells are fused to mouse splenocytes using 
polyethylene glycol ("PEG") . Hybridoma cells resulting 
from the fusion are then selected using HAT medium, which 
kills unfused and unproductively fused myeloma cells 

3 5 (unfused splenocytes die after several days because they 
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are not transformed) . Hybridoma cells producing a 
monoclonal antibody of the invention are detected by 
screening the hybridoma culture supematants for 
antibodies that bind HRPCa-9, e.g., using a standard 
5 ELISA assay. 

Alternative to preparing monoclonal antibody- 
secreting hybridbmas, a monoclonal antibody can be 
identified and isolated by screening a recombinant 
combinatorial immunoglobulin library (e.g., an antibody 

10 phage display library) with the protein of interest to 
thereby isolate immunoglobulin library members that bind 
the protein of interest. Kits for generating and 
screening phage display libraries are commercially 
available (e.g., the Pharmacia Recombinant Phage Antibody 

15 System, Catalog No. 27-9400-01; and the Stratagene 
SurfZAP'" Phage Display Kit, Catalog No. 240612) . 
Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 

20 U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 
Publication No. WO 92/20791; PCT Publication No. WO 
92/15679; PCT Publication No. WO 93/01288; PCT 
Publication No. WO 92/01047; PCT Publication No. VJO 

25 92/09690; PCT Publication No. WO 90/02809; Fuchs et al . 
(1991) Bio/Technology 9:1370-1372; Hay et al . (1992) Hum. 
Antibod. Hybridomas 3:81-85; Huse et al . (1989) Science 
246:1275-1281; Griffiths et al . (1993) EMBO J 12:725-734. 
Also useful in the methods of the invention are 

3 0 recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non- 
human portions, which can be made using standard 
recombinant DNA techniques. Such chimeric and humanized 
monoclonal antibodies can be produced by recombinant DNA 

35 techniques known in the art, for example using methods 
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described in PCT Publication No. WO 87/02S71; European 
Patent Application 184,187; European Patent Application 
171,4 96; European Patent Application 173,494; PCT 
Publication No. WO 86/01533; U.S. Patent No. 4,816,567; 
5 European Patent Application 125,023; Better et al . (1988) 
Science 240:1041-1043; Liu et al . (1987) Proc. Natl. 
Acad. Sci. USA 84:3439-3443; Liu et al . (1987) J. 
Immunol. 139:3521-3526; Sun et al . (1987) Proc. Natl. 
Acad. Sci. USA 84:214-218; Nishimura et al . (1987) Cane. 

10 Res. 47:999-1005; Wood et al . (1985) Nature 314:446-449; 
and Shaw et al . (1988) J. Natl. Cancer Inst. 80:1553- 
1559); Morrison, (1985) Science 229:1202-1207; Oi et al . 
(1986) Bio/Technigues 4 :214; U.S. Patent 5,225,539; Jones 
et al. (1986) JVature 321:552-525; Verhoeyan et al. (1988) 

15 Science 239:1534; and Beidler et al . (1988) J. Immunol. 
141 :4053-4060. 

An antibody (e.g., monoclonal antibody) directed 
against a polypeptide of the invention can be used to 
isolate a polypeptide of the invention by standard 

20 techniques, such as affinity chromatography or 

immunoprecipitation and to detect the polypeptide (e.g., 
in a cellular lysate or cell supernatant) in order to 
evaluate the abundance and pattern of expression of the 
polypeptide. Thus, antibodies directed against a 

25 polypeptide of the invention can be used diagnostically 
to monitor protein levels in tissue as part of a clinical 
testing procedure, e.g., to, for example, determine the 
efficacy of a given treatment regimen. Detection can be 
facilitated by coupling the antibody to a detectable 

30 substance. Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent 
materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline 

3 5 phosphatase, /S-galactosidase, or acetylcholinesterase; 
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examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbellif erone , 
fluorescein, fluorescein isothiocyanate, rhodamine, 
5 dichlorotriazinylamine fluorescein, dansyl chloride or 
phycoerythrin; an example of a luminescent material 
includes luminol; examples of bioluninescent materials 
include luciferase, luciferin, and aequorin, and examples 
of suitable radioactive material include "^I, "^I, or 
10 ^H. 

III. Recombinant Expression Vectors and Host Cells 

Vectors, preferably expression vectors, containing 
a nucleic acid encoding protein of the invention or a 
portion thereof are useful in the methods of the 

15 invention. As used herein, the term "vector" refers to a 
nucleic acid molecule capable of transporting another 
nucleic acid to which it has been linked. One type of 
vector is a "plasmid" , which refers to a circular double 
stranded DNA loop into which additional DNA segments can 

20 be ligated. Another type of vector is a viral vector, 
wherein additional DNA segments can be ligated into the 
viral genome . Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 

25 replication and episomal mammalian vectors) . Other 
vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 

30 vectors, expression vectors, are capable of directing the 
expression of genes to which they are operatively linked. 
In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
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such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses) , which serve 
equivalent functions. 
5 Useful recombinant expression vectors of the 

invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 
host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 

10 selected on the basis of the host cells to be used. for 
expression, which is operatively linked to the nucleic 
acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 

15 regulatory sequence (s) in a manner which allows for 
expression of the nucleotide sequence (e.g., in an in 
vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 

20 promoters, enhancers and other expression control 
elements (e.g., polyadenylation signals). Such 
regulatory sequences are described, for example, in 
Goeddel; Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990) . 

25 Regulatory sequences include those which direct 

constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 
the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sec[uences) . It will be 

3 0 appreciated by those skilled in the art that the design 
of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level 
of expression of protein desired, etc. The expression 
vectors of the invention can be introduced into host 

35 cells to thereby produce proteins or peptides, including 
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fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., HRPCa 9 proteins, mutant forms of 
HRPCa 9, fusion proteins, etc.). 

The recombinant expression vectors of the 
5 invention can be designed for expression of a polypeptide 
of the invention in prokaryotic or eukaryotic cells, 
e.g., bacterial cells such as E. coli, insect cells 
(using baculovirus expression vectors) yeast cells or 
mammalian cells. Suitable host cells are discussed 

IQ further in Goeddel, Gene Expression Technology: Methods 
in Enzymology 185, Academic Press, San Diego, CA (1990) . 
Alternatively, the recombinant expression vector can be 
transcribed and translated in vitro, for example using T7 
promoter regulatory sequences and T7 polymerase. 

15 Expression of proteins in prokaryotes is most 

often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 

20 encoded therein, usually to the amino terminus of the 

recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 

25 recombinant protein by acting as a ligand in affinity 
purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 

30 fusion moiety subsequent to purification of the fusion 
protein. Such enzymes, and their cognate recognition 
sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 

35 pMAL (New England Biolabs, Beverly, MA) and pRIT5 
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(Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transferase (GST) , maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 

Examples of suitable inducible non-fusion E. coli 
5 expression vectors include pTrc (Amann et al . , (1988) 
Gene 69:301-315) and pET lid (Studier et al . , Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 

10 host RNA polymerase transcription from a hybrid trp-lac 
fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 

15 strains BL21(DE3) or HMS174(DE3) from a resident X 
prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein 
expression in E. coli is to express the protein in a host 

20 bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 119-128) . 
Another strategy is to alter the nucleic acid sequence of 

25 the nucleic acid to be inserted into an expression vector 
so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al . 
(1992) Nucleic Acids Res. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 

3 0 out by standard DNA synthesis techniques. 

Examples of vectors for expression in yeast 
S. cerivisae include pYepSecl (Baldari et al. (1987) 
EMBO J. 5:229-234), pMFa (Kurjan and Herskowitz, (1982) 
Ceil 30:933-943), pJRY88 (Schultz et al . (1987) Gene 
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54:113-123), pYES2 (Invitrogen Corporation, San Diego, 
CA) , and picZ (InVitrogen Corp, San Diego, CA) . 

Baculovirus vectors available for expression of 
proteins in cultured insect cells (e.g., Sf 9 cells) 
5 include the pAc series (Smith et al . (1983) Mol. Cell 
Biol. 3:2156-2165) and the pVL series (Lucklow and 
Summers (1989) Virology 170:31-39). 

Examples of mammalian expression vectors include 
pCDM8 (Seed (1987) Nature 329:840) and pMT2PC (Kaufman et 

10 al . (1987) EMBO J. 6:187-195). When used in mammalian 
cells, the expression vector's control functions are 
often provided by viral regulatory elements. For 
example, commonly used promoters are derived from 
polyoma. Adenovirus 2, cytomegalovirus and Simian Virus 

15 40. For other suitable expression systems for both 

prokaryotic and eukaryotic cells see chapters 16 and 17 
of Sambrook et al . {supra). 

Certain recombinant mammalian expression vector 
are capaible of directing expression of a nucleic acid 

20 preferentially in a particular cell type (e.g., tissue- 
specific regulatory elements are used to express the 
nucleic acid) . Tissue-specific regulatory elements are 
known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter 

25 (liver-specific; Pinkert et al . (1987) Genes Dev. 1:268- 
277) , lymphoid-specif ic promoters (Calame and Eaton 
(1988) Adv. Imnunol. 43:235-275), in particular promoters 
of T cell receptors (Winoto and Baltimore (1989) EMBO J. 
8:729-733) and immunoglobulins (Banerji et al . (1983) 

30 Cell 33:729-740; Queen and Baltimore (1983) Cell 33:741- 
748), neuron-specific promoters (e.g., the neurofilament 
promoter; Byrne and Ruddle (1989) Proc. Natl. Acad. Sci . 
USA 86:5473-5477), pancreas-specif ic promoters (Edlund et 
al. (1985) Science 230:912-916), and mammary gland- 

35 specific promoters (e.g., milk whey promoter; U.S. Patent 
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No. 4,873,316 and European Application Publication No. 
264, 16S). Developmentally-regulated promoters are also 
encompassed, for example the murine hox promoters {Kessel 
and Gruss (1990) Science 249:374-379) and the a- 
5 fetoprotein promoter (Campes and Tilghman (1989) Genes 
Dev. 3:537-546) . 

Recombinant expression vector comprising a nucleic 
acid of the invention cloned into the expression vector 
in an antisense orientation are useful therapeutically. 

10 Regulatory sequences operatively linked to a nucleic acid 
cloned in the antisense orientation can be chosen which 
direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral 
promoters and/or enhancers, or regulatory sequences can 

15 be chosen which direct constitutive, tissue specific or 
cell type specific expression of antisense RNA. The 
antisense expression vector can be in the form of a 
recombinant plasmid, phagemid or attenuated virus in 
which antisense nucleic acids are produced under the 

20 control of a high efficiency regulatory region, the 

activity of which can be determined by the cell type into 
which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes see 
Weintraub et al. {Reviews - Trends in Genetics, Vol. 1(1) 

25 1986) . 

Host cells into which a recombinant expression 
vector capable of directing expression of a protein if 
the invention has been introduced are useful in the 
screening methods of the invention. 

30 A host cell can be any prokaryotic or eukaryotic 

cell, for example, a protein of the invention can be 
expressed in bacterial cells such as E. coli, insect 
cells, yeast or mammalian cells (such as Chinese hamster 
ovary cells (CHO) or COS cells) . Other suitable host 

35 cells are known to those skilled in the art. 
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Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 
transf action techniques, including calcium phosphate or 
calcium chloride co-precipitation, DEAE-dextran-mediated 
5 transf action, lipofection, or electroporation. Suitable 
methods for transforming or transfecting host cells can 
be found in Sambrook, et al. (supra), and other 
laboratory manuals. 

For stable transf ection of mammalian cells, it is 

10 known that, depending upon the expression vector and 
transfection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 
In order to identify and select these integrants, a gene 
that encodes a selecteible marker (e.g., for resistance to 

15 antibiotics) is generally introduced into the host cells 
along with the gene of interest . Preferred selectable 
markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Nucleic acid 
encoding a selectable marker can be introduced into a 

20 host cell on the same vector as that encoding HRPCa-9 or 
can be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 

25 while the other cells die) . 

A host cell of the invention, such as a 
prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) a protein of the 
invention using standard techniques. 

3 0 Certain host cells can be used to produce nonhuman 

transgenic animals. For example, a host cell of the 
invention is a fertilized oocyte or an embryonic stem 
cell into which HRPCa 9-coding sequences have been 
introduced. Such host cells can then be used to create 

35 non-human transgenic animals in which exogenous HRPCa 9 
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homologous recombinant animals in which endogenous HRPCa 
9 sequences have been altered. Such animals are useful 
for studying the fiinction and/or activity of HRPCa 9 and 
5 for identifying and/or evaluating modulators of HRPCa 9 
activity. The transgenic animal is preferably a mammal, 
more preferably a rodent such as a rat or mouse, in which 
one or more of the cells of the animal includes a 
transgene. Other examples of transgenic animals include 

10 non-human primates, sheep, dogs, cows, goats, chickens, 
amphibians, etc. A transgene is exogenous DNA which is 
integrated into the genome of a cell from which a 
transgenic animal develops and which remains in the 
genome of the mature animal, thereby directing the 

15 expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal. As used 
herein, an "homologous recombinant animal" is a non-human 
animal, preferably a mammal, more preferably a mouse, in 
which an endogenous gene has been altered by homologous 

20 recombination between the endogenous gene and an 

exogenous DNA molecule introduced into a cell of the 
animal, e.g., an embryonic cell of the animal, prior to 
development of the animal . 

A transgenic expressing a protein of the invention 

25 can be created by introducing nucleic acid sequence 

encoding the protein of interest into the male pronuclei 
of a fertilized oocyte, e.g., by microinjection, 
retroviral infection, and allowing the oocyte to develop 
in a pseudopregnant female foster animal . A tissue- 

30 specific regulatory sequence (s) can be operably linked to 
the transgene to direct expression of the protein to 
particular cells. Methods for generating transgenic 
animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become 

35 conventional in the art and are described, for example, 
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in U.S. Patent Nos. 4,736,856 and 4,870,009, U.S. Patent 
No. 4,873,191 and in Hogan, Manipulating the Mouse 
Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. , 1986) . Similar methods are used for 
5 production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the transgene in its genome and/or expression of mRNA 
produced by the transgene in the cells of the animals. A 
transgenic founder animal can then be used to breed 
10 additional animals carrying the transgene. Moreover, 

transgenic animals carrying a given transgene can further 
be bred to other transgenic animals carrying other 
transgenes . 

To create an homologous recombinant animal, a 

15 vector is prepared which contains at least a portion of a 
gene of interest (e.g., the human HRPCa 9 gene or a 
murine HRPCa 9 gene) into which a deletion, addition or 
substitution has been introduced to thereby alter, e.g., 
functionally disrupt, the gene. The vector can be 

20 designed such that, upon homologous recombination, the 
endogenous gene is functionally disrupted (i.e., no 
longer encodes a functional protein; also referred to as 
a "knock out" vector) . Alternatively, the vector can be 
designed such that, upon homologous recombination, the 

25 endogenous gene is mutated or otherwise altered but still 
encodes functional protein (e.g., the upstream regulatory 
region can be altered to thereby alter the expression of 
the endogenous protein) . In the homologous 
recombination vector, the altered portion of the gene is 

3 0 flanked at its 5' and 3' ends by additional nucleic acid 
of the gene to allow for homologous recombination to 
occur between the exogenous gene carried by the vector 
and an endogenous gene in an embryonic stem cell. The 
additional flanking nucleic acid is of sufficient length 

35 for successful homologous recombination with the 
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endogenous gene. Typically, several kilobases of 
flanking DNA (both at the 5' and 3' ends) are included in 
the vector (see, e.g., Thomas and Capecchi (1987) Cell 
51:503 for a description of homologous recombination 
5 vectors) . The vector is introduced into an embryonic 
stem cell line (e.g., by electroporation) and cells in 
which the introduced gene has homologously recombined 
with the endogenous gene are selected (see, e.g., Li et 
al. (1992) Cell 69:915). The selected cells are then 

10 injected into a blastocyst of an animal (e.g., a mouse) 
to form aggregation chimeras (see, e.g., Bradley in 
Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 113- 
152) . A chimeric embryo can then be implanted into a 

15 suitable pseudopregnant female foster animal and the 
embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 

20 transmission of the transgene. Methods for constructing 
homologous recombination vectors and homologous 
recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/11354, VJO 91/01140, WO 

25 92/0968, and WO 93/04169. 

Transgenic non-human animals can be produced which 
contain selected systems which allow for regulated 
expression of the transgene. One example of such a 
system is the cre/loxP recombinase system of 

30 bacteriophage PI. For a description of the cre/loxP 

recombinase system, see, e.g., Lakso et al . (1992) Proc. 
Natl. Acad. Sci. USA 89:6232-6236. Another example of a 
recombinase system is the FLP recombinase system of 
Saccharomyces cerevisiae (0' Gorman et al. (1991) Science 

35 251:1351-1355. If a cre/loxP recombinase system is used 
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to regulate expression of the transgene, animals 
containing transgenes encoding both the Cre recombinase 
and a selected protein are required. Such animals can be 
provided through the construction of "doiible" transgenic 
5 animals, e.g., by mating two transgenic animals, one 
containing a transgene encoding a selected protein and 
the other containing a transgene encoding a recombinase. 

Clones of the non-human transgenic animals 
described herein can also be produced according to the 

10 methods described in Wilmut et al . (1997) Wature 385:810- 
813 and PCT Publication Nos . WO 97/07668 and WO 97/07669. 
In brief, a cell, e.g., a somatic cell, from the 
transgenic animal can be isolated and induced to exit the 
growth cycle and enter phase. The quiescent cell can 

L5 then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 
develops to morula or blastocyte and then transferred to 

20 pseudopregnant female foster animal. The offspring borne 
of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is 
isolated. 

IV. Pharmaceutical Compositions 

25 The nucleic acids and polypeptides of the 

invention as well as compounds which are modulators of 
the expression or activity of a protein of the invention 
can be incorporated into pharmaceutical compositions 
suitable for therapeutic or prophylactic use. Such 

30 compositions typically comprise the nucleic acid 
molecule, protein, or other compound and a 
pharmaceutically acceptable carrier. The invention 
includes methods for preparing pharmaceutical 
compositions for modulating the expression or activity of 
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HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, 
HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19. Such methods 
comprise formulating a phartnaceutically acceptable 
carrier with an agent which modulates expression or 
5 activity of HRPCa 2, HPRCa 3, HRPCa 6/7. HRPCa 8, HRPCa 
9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19. 
Such compositions can further include an ant i -androgenic 
compound. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating 

10 a pharmaceutical ly acceptable carrier with an agent which 
modulates expression or activity of HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, or HRPCa 19 and an ant i- androgenic agent. 
As used herein the language "pharmaceutically 

15 acceptable carrier" is intended to include any and all 
solvents, dispersion media, coatings, antibacterial and 
antifungal agents, isotonic and absorption delaying 
agents, and the like, compatible with pharmaceutical 
administration. The use of such media and agents for 

20 pharmaceutically active substances is well known in the 
art . Except insofar as any conventional media or agent 
is incompatible with the active compound, use thereof in 
the compositions is contemplated. Supplementary active 
compounds can also be incorporated into the compositions. 

25 A pharmaceutical composition of the invention is 

formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral, e.g., intravenous, intradermal, 
subcutaneous, oral (e.g., inhalation), transdermal 

30 (topical) , transmucosal, and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 
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glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
5 ethylenediaminetetraacetic acid; buffers such as 
acetates, citrates or phosphates and agents for the 
adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
10 preparation can be enclosed in ampoules, disposable 

syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injection 
use include sterile aqueous solutions (where water 
soluble) or dispersions and sterile powders for the 

15 extemporaneous preparation of sterile injectable 
solutions or dispersions. For intravenous 
administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF; 
Parsippany, NJ) or phosphate buffered saline (PBS) . In 

20 all cases, the composition must be sterile and should be 
fluid to the extent that easy syringability exists. It 
must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating 
action of microorganisms such as bacteria and fungi. The 

25 carrier can be a solvent or dispersion medium containing, 
for example, water, ethanol, polyol (for example, 
glycerol, propylene glycol, and liquid polyetheylene 
glycol, and the li)ce) , and suitable mixtures thereof. 
The proper fluidity can be maintained, for example, by 

30 the use of a coating such as lecithin, by the maintenance 
of the required particle size in the case of dispersion 
and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various 
antibacterial and antifungal agents, for example. 
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parcibens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be 
preferable to include isotonic agents, for example, 
sugars, polyalcohols such as mannitol, sorbitol, sodium 
5 chloride in the composition. Prolonged absorption of the 
injectable compositions can be brought about by including 
in the composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 

10 incorporating the active compound (e.g., a HRPCa-9 

protein or anti-HRPCa-9 antibody) in the required amount 
in an appropriate solvent with one or a combination of 
ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are 

15 prepared by incorporating the active compound into a 

sterile vehicle which contains a basic dispersion medium 
and the required other ingredients from those enumerated 
above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the 

20 preferred methods of preparation are vacuum drying and 
f reeze-drying which yields a powder of the active 
ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert 

25 diluent or an edible carrier. They can be enclosed in 
gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 

30 compositions can also be prepared using a fluid carrier 
for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutically compatible binding agents, 
and/or adjuvant materials can be included as part of the 

35 composition. The tablets, pills, capsules, troches and 
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the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
5 agent such as alginic acid, Primogel, or corn starch; a 
lubricant such as magnesium stearate or Sterotes; a 
glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 

10 flavoring. For administration by inhalation, the 

compounds are delivered in the form of an aerosol spray 
from a pressurized container or dispenser which contains 
a suitable propellant, e.g., a gas such as carbon 
dioxide, or a nebulizer. 

15 Systemic administration can also be by 

transmucosal or transdermal means . For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art, and 

20 include, for example, for transmucosal administration, 
detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories . For 
transdermal administration, the active compounds are 

25 formulated into ointments, salves, gels, or creams as 
generally known in the art . 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 

30 enemas for rectal delivery. 

In one embodiment, the active compounds are 
prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 

35 microencapsulated delivery systems. Biodegradable, 
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biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides , polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
5 those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
10 pharmaceutically acceptable carriers. These can be 

prepared according to methods known to those skilled in 
the art, for example, as described in U.S. Patent No. 
4, 522, 811. 

It is especially advantageous to formulate oral or 

15 parenteral compositions in dosage unit form for ease of 
administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 

20 compound calculated to produce the desired therapeutic 
effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 

25 particular therapeutic effect to be achieved, and the 
limitations inherent in the art of compounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors. 

30 Gene therapy vectors can be delivered to a subject by, 
for example, intravenous injection, local administration 
(U.S. Patent 5,328,470) or by stereotactic injection 
(see, e.g., Chen et al . (1994) Proc. Natl. Acad. Sci . USA 
91:3054-3057). The pharmaceutical preparation of the 

35 gene therapy vector can include the gene therapy vector 
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in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 
5 retroviral vectors, the pharmaceutical preparation can 
include one or more cells which produce the gene delivery 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
10 instructions for administration. 



V. General Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 

homologues, and antibodies described herein can be used 

in one or more of the following methods: a) screening 
15 assays; b) predictive medicine (e.g., diagnostic assays, 

prognostic assays, monitoring clinical trials, and 

phannacogenomics) ; and c) methods of treatment (e.g., 

therapeutic and prophylactic) . 

This invention further pertains to novel agents 
20 identified by the screening assays and uses thereof for 

treatments as described herein. 



A. Screening Assays 

The invention provides methods (also referred to 
herein as "screening assays") for identifying modulators, 

25 i.e., candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, small molecules or other 
drugs) which bind to a protein of the invention or have 
an modulatory effect on the expression or activity of a 
protein of the invention. Modulators which decrease the 

30 expression or activity of a protein of the invention may 
be useful for the treatment of prostate cancer. Such 
modulators may slow the growth of androgen- independent 
(or androgen -dependent) prostate cancer and/or cause an 
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androgen- independent prostate cancer to become androgen- 
dependent . 

In one embodiment, the invention provides assays 
for screening candidate or test compounds which bind to 
5 or modulate the activity of the membrane -bound form of a 
protein or polypeptide of the invention or biologically 
active portion thereof. The test compounds of the 
present invention can be obtained using any of the 
numerous approaches in combinatorial library methods 

10 known in the art, including: biological libraries; 

spatially addressable parallel solid phase or solution 
phase libraries; synthetic library methods requiring 
deconvolution; the "one-bead one-compound" library 
method; and synthetic library methods using affinity 

15 chromatography selection. The biological library 

approach is limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam 
(1997) Anticancer Drug Des. 12:145). 

20 Examples of methods for the synthesis of molecular 

libraries can be found in the art, for example in: 
DeWitt et al . (1993) Proc. Natl. Acad. Sci . U.S.A. 
90:6909; Erb et al . (1994) Proc. Natl. Acad. Sci. USA 
91:11422; Zuckermann et al . (1994). J. Med. Chem. 

25 37:2678; Cho et al . (1993) Science 261:1303; Carrell et 
al. (1994) Angew. Chem. Int. Ed. Engl. 33:2059; Carell et 
al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and 
Gallop et al. (1994) J. Med. Chem. 37:1233. 

Libraries of compounds may be presented in 

30 solution (e.g., Houghten (1992) Bio/Technigues 13:412- 
421), or on beads (Lam (1991) Nature 354:82-84), chips 
(Fodor (1993) Nature 364:555-556) , bacteria (U.S. Patent 
No. 5,223,409), spores (Patent Nos. 5,571,698; 5,403,484; 
and 5,223,409), plasmids (Cull et al . (1992) Proc. Natl. 

35 Acad. Sci. USA 89:1865-1869) or phage (Scott and Smith 
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(1990) Science 249:386-390; Devlin (1990) Science 
249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 
87:6378-6382; and Felici (1991) J. Mol . Biol. 222:301- 
310) . 

5 In one embodiment, an assay is a cell -based assay- 

in which a cell which expresses a membrane -bound form of 
a protein of the invention or a biologically active 
portion thereof, on the cell surface is contacted with a 
test compound and the ability of the test compound to 

10 bind to the protein deteirmined. The cell, for example, 
can be a yeast cell or a cell of mammalian origin. 
Determining the ability of the test compound to bind to 
the protein can be accomplished, for example, by coupling 
the test compound with a radioisotope or enzymatic label 

15 such that binding of the test compound to the protein or 
biologically active portion thereof can be determined by 
detecting the labeled compound in a complex. For 
example, test compounds can be labeled with ^^*I , ^^S, "C, 
or ^H, either directly or indirectly, and the radioisotope 

20 detected by direct counting of radioemmission or by 

scintillation counting. Alternatively, test compounds 
can be enzymatically labeled with, for example, 
horseradish peroxidase, alkaline phosphatase, or 
lucif erase, and the enzymatic label detected by 

25 determination of conversion of an appropriate substrate 
to product. In a preferred embodiment, the assay 
comprises contacting a cell which expresses a membrane- 
bound form of a protein of the invention, or a 
biologically active portion thereof, on the cell surface 

30 with a known compound which binds the protein to form an 
assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test 
compound to interact with the protein, wherein 
determining the ability of the test compound to interact 

35 with the protein comprises determining the ability of the 
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test compound to preferentially bind to the protein or a 
biologically active portion thereof as compared to the 
known compound. 

In another embodiment, an assay is a cell-based 
5 assay comprising contacting a cell expressing, a protein 
of the invention, or a biologically active portion 
thereof, on the cell surface with a test compound and 
determining the ability of the test compound to modulate 
(e.g., stimulate or inhibit) the activity of the protein 

10 or biologically active portion thereof. Deteinnining the 
ability of the test compound to modulate the activity of 
the protein or a biologically active portion thereof can 
be accomplished, for example, by determining the ability 
of the protein to bind to or interact with a target 

15 molecule of the protein, i.e., a molecule with which the 
protein binds or interacts in nature. 

Determining the ability of a protein of the 
invention to bind to or interact with a molecule can be 
accomplished by one of the methods described above for 

20 determining direct binding. In a preferred embodiment, 
determining the ability of the protein to bind to or 
interact with a target molecule can be accomplished by 
determining the activity of the target molecule. For 
example, the activity of the target molecule can be 

25 determined by detecting induction of a cellular second 
messenger of the target (e.g., intracellular Ca^*, 
diacylglycerol , IP3, etc.), detecting catalytic/enzymatic 
activity of the target on an appropriate siibstrate, 
detecting the induction of a reporter gene (e.g., a 

30 regulatory element responsive to the protein operatively 
linked to a nucleic acid encoding a detectable marker, 
e.g. lucif erase) , or detecting a cellular response, for 
example, cellular differentiation, or cell proliferation. 

In yet another embodiment, an assay of the present 

35 invention is a cell-free assay comprising contacting a 
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protein of the invention or biologically active portion 
thereof with a test compound and determining the ability 
of the test compound to bind to the protein or 
biologically active portion thereof. Binding of the test 
5 compound to the protein can be determined either directly 
or indirectly as described above. In a preferred 
embodiment, the assay includes contacting the protein or 
biologically active portion thereof with a known compound 
which binds the protein to form an assay mixture, 

10 contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with the protein, wherein determining the ability of the 
test compound to interact with the protein comprises 
determining the ability of the test compound to 

15 preferentially bind to the protein or biologically active 
portion thereof as compared to the known compound. 

In another embodiment, an assay is a cell -free 
assay comprising contacting a protein of the invention or 
biologically active portion thereof with a test compound 

20 and determining the eibility of the test compound to 

modulate (e.g., stimulate or inhibit) the activity of the 
protein or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of the protein can be accomplished, for 

25 example, by determining the ability of the protein to 

bind to a target molecule by one of the methods described 
above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound 
to modulate the activity of the protein can be 

30 accomplished by determining the ability of the protein to 
further modulate a target molecule. For example, the 
catalytic/enzymatic activity of the target molecule on an 
appropriate substrate can be determined as previously 
described. 
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In yet another embodiment, the cell -free assay 
comprises contacting a protein of the invention or a 
biologically active portion thereof with a known compound 
which binds to the protein to form an assay mixture, 
5 contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 
with the potein, wherein determining the ability of the 
test compound to interact with the protein comprises 
determining the ability of the protein to preferentially 
10 bind to or modulate the activity of a target molecule of 
the protein. 

The cell -free assays of the present invention are 
amenable to use of both the soluble form or the membrane - 
bound form of a protein of the invention. In the case of 

15 cell-free assays comprising the membrane -bound form of a 
protein of the invention, it may be desirable to utilize 
a solubilizing agent such that the membrane - bound form of 
the protein is maintained in solution. Examples of such 
solubilizing agents include non- ionic detergents such as 

20 n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, 
octanoyl-N-methylglucamide, decanoyl-N-methylglucamide, 
Triton® X-100, Triton® X-114, Thesit®, 
Isotridecypoly (ethylene glycol ether)n, 3-[(3- 
cholamidopropyl) dimethylamminio] -1 -propane sulfonate 

25 (CHAPS), 3- [ (3 -cholamidopropyl) dimethylamminio] -2- 

hydroxy-1 -propane sulfonate (CHAPSO) , or N-dodecyl=N, N- 
dimethyl - 3 -ammonio- 1 -propane sulfonate . 

In more than one embodiment of the above assay 
methods of the present invention, it may be desirable to 

30 immobilize either a protein of the invention or its 
target molecule to facilitate separation of complexed 
from uncomplexed forms of one or both of the proteins, as 
well as to accommodate automation of the assay. Binding 
of a test compound to a protein of the invention, or the 

35 protein with a target molecule in the presence and 
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absence of a candidate compound, can be accomplished in 
any vessel suitable for containing the reactants. 
Examples of such vessels include microtitre plates, test 
tubes, and micro-centrifuge tubes. In one embodiment, a 
5 fusion protein can be provided which adds a domain that 
allows one or both of the proteins to be bound to a 
matrix. For example, glutathione-S-transf erase/protein 
of the invention fusion proteins or glutathione-S- 
transf erase/target fusion proteins can be adsorbed onto 

10 glutathione sepharose beads (Sigma Chemical; St. Louis, 
MO) or glutathione derivatized microtitre plates, which 
are then combined with the test compound or the test 
compound and either the non-adsorbed target protein or a 
protein of the invention, and the mixture incubated under 

15 conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH) . Following 
incubation, the beads or microtitre plate wells are 
washed to remove any unbound components and complex 
formation is measured either directly or indirectly, for 

20 example, as described above. Alternatively, the 

complexes can be dissociated from the matrix, and the 
level of binding or activity of the protein of the 
invention can be determined using standard techniques. 

Other techniques for immobilizing proteins on 

25 matrices can also be used in the screening assays of the 
invention. For example, either a protein of the 
invention or its target molecule can be immobilized 
utilizing conjugation of biotin and streptavidin. 
Biotinylated molecules can be prepared from biotin-NHS 

30 (N-hydroxy-succinimide) using techniques well known in 
the art (e.g., biotinylation kit. Pierce Chemicals; 
Rockford, IL) , and immobilized in the wells of 
streptavidin-coated 96 well plates (Pierce Chemical) . 
Alternatively, antibodies reactive with a protein of the 

35 invention or a target molecule but which do not interfere 
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with binding of the protein of the invention to its 
target molecule can be derivatized to the wells of the 
plate, and unbound target or protein trapped in the wells 
by antibody conjugation. Methods for detecting such 
5 complexes, in addition to those described above for the 
GST- immobilized complexes, include immunodetection of 
complexes using antibodies reactive with the protein of 
the invention or target molecule, as well as enzyme- 
linked assays which rely on detecting an enzymatic 

10 activity associated with a protein of the invention or 
its target molecule. 

In another embodiment, modulators of expression of 
a protein of the invention are identified in a method in 
which a cell is contacted with a candidate compound and 

15 the expression of the mRNA or protein of the invention in 
the cell is determined. The level of expression of mPiNA 
or protein of the invention in the presence of the 
candidate compoxind is compared to the level of expression 
in the absence of the candidate compound. The candidate 

20 compound can then be identified as a modulator of 

expression based on this comparison. For example, when 
expression of the mRNA or protein of the invention is 
greater (statistically significantly greater) in the 
presence of the candidate compound than in its absence, 

25 the candidate compound is identified as a stimulator of 
mRNA or protein expression. Alternatively, when 
expression of the mRNA or protein of the invention is 
less (statistically significantly less) in the presence 
of the candidate compound than in its absence, the 

30 candidate compound is identified as an inhibitor of mRNA 
or protein expression. The level of expression of an 
mRNA or protein of the invention can be determined by 
methods described herein. 

In yet another aspect of the invention, a protein 

35 of the invention can be used as "bait proteins" in a two- 
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hybrid assay or three hybrid assay (see, e.g., U.S. 
Patent No. 5,283,317; Zervos et al. (1993) Cell 72:223- 
232; Madura et al . (1993) J. Biol. Chem. 268:12046-12054; 
Bartel et al . (1993) Bio/Techniques 14:920-924; Iwabuchi 
5 et al . (1993) Oncogene 8:1693-1696; and PCT Publication 
No. WO 94/10300) , to identify other proteins, which bind 
to or interact with the protein and modulate activity of 
the protein. 

In another embodiment, modulators of activity of a 

10 protein of the invention are identified in a method in 
which a cell is contacted with a candidate compound and 
the activity of a protein of the invention in the cell is 
determined. The level of activity of the protein of the 
invention in the presence of the candidate compound is 

15 compared to the level of expression in the absence of the 
candidate compound. The candidate compound can then be 
identified as a modulator of activity based on this 
comparison. For example, when the activity of a protein 
of the invention is greater (statistically significantly 

20 greater) in the presence of the candidate compound than 
in its absence, the candidate compound is identified as a 
stimulator protein activity. Alternatively, when the 
activity of a protein of the invention is less 
(statistically significantly less) in the presence of the 

25 candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of protein 
activity. Such inhibitors are useful for the treatment 
of androgen- dependent and androgen- independent prostate 
cancers. The level of activity of a protein of the 

30 invention can be determined by an method used to measure 
activity of the protein. 

This invention further pertains to novel agents 
identified by the above-described screening assays and 
uses thereof for treatments as described herein. 
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B . Predictive Medicine 

The present invention also pertains to the field 
of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics, and monitoring 
5 clinical trails are used for prognostic (predictive) 

purposes to thereby treat an individual prophylactically . 
Accordingly, one aspect of the present invention relates 
to diagnostic and prognostic (or predictive) assays for 
determining the expression or activity of a protein of 

10 invention, in the context of a biological sample (e.g., 
prostate cells or tissue) to thereby determine whether an 
individual is afflicted with prostate cancer, or is at 
risk of developing prostate cancer (or another cancer) . 
The methods are particularly useful for determining 

15 whether an individual is at risk for developing an 

androgen- independent prostate cancer. Such assays can be 
used for prognostic or predictive purpose to thereby 
prophylactically treat an individual prior to the onset 
of prostate cancer. 

20 Another aspect of the invention provides methods 

for determining the expression of a nucleic acid or 
protein of the invention or the activity of a protein of 
the invention in an individual to thereby select 
appropriate therapeutic or prophylactic agents for that 

25 individual (referred to herein as "pharmacogenomics"). 
Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 
of an individual based on the genotype of the individual 
(e.g., the genotype of the individual examined to 

30 determine the ability of the individual to respond to a 
particular agent.) 

Yet another aspect of the invention pertains to 
monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression of a nucleic acid or protein 
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invention in clinical trials. 

These and other agents are described in further 
detail in the following sections. 

5 1 . Diagnostic Assays 

An exemplary method for detecting the presence or 
absence of a protein or nucleic acid of the invention in 
biological sample involves obtaining a biological sample 
from a test subject and contacting the biological sample 

10 with a compound or an agent capable of detecting the 
protein or nucleic acid (e.g., mRNA, genomic DNA) such 
that the presence of the protein or nucleic acid is 
detected in the biological sample. A preferred agent for 
detecting mRNA or genomic DNA is a labeled nucleic acid 

15 probe capable of hybridizing to the selected mRNA or 

genomic DNA. The nucleic acid probe can be, for example, 
a full-length nucleic acid of the invention or a portion 
thereof, such as an oligonucleotide of at least 15, 30, 
50, 100, 250 or 500 nucleotides in length and sufficient 

20 to specifically hybridize under stringent conditions to 
the selected mRNA or genomic DNA. Other suitable probes 
for use in the diagnostic assays of the invention are 
described herein. 

A preferred agent for detecting a protein of the 

25 invention is an antibody capable of binding to the 

protein, preferably an antibody with a detectable label. 
Antibodies can be polyclonal, or more preferably, 
monoclonal. An intact antibody, or a fragment thereof 
(e.g.. Fab or F(ab')2) can be used. The term "labeled," 

30 with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by 
coupling (i.e., physically linking) a detectable 
substance to the probe or antibody, as well as indirect 
labeling of the probe or antibody by reactivity with 
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another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody 
using a f luorescently labeled secondary antibody and end- 
labeling of a DNA probe with biotin such that it caxi be 
5 detected with f luorescently labeled streptavidin . The 
term "biological sample" is intended to include tissues, 
cells (e.g., prostate cells) and biological fluids 
isolated from a subject, as well as tissues, cells and 
fluids present within a subject. That is, the detection 

10 method of the invention can be used to detect a nucleic 
acid or protein of the invention in a biological sample 
in vitro as well as in vivo. For example, in vitro 
techniques for detection of mRNA include Northern 
hybridizations and in situ hybridizations. In vitro 

15 techniques for detection of protein include enzyme 
linked immunosorbent assays (ELISAs) , Western blots, 
immunoprecipi tat ions and immunofluorescence. In vitro 
techniques for detection of genomic DNA include Southern 
hybridizations. Furthermore, in vivo techniques for 

20 detection of protein include introducing into a subject 
a labeled antibody directed against a protein of the 
invention. For example, the antibody can be labeled with 
a radioactive marker whose presence and location in a 
subject can be detected by standard imaging techniques. 

25 In one embodiment, the biological sample contains 

protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. A preferred biological sample is a peripheral 

30 blood leukocyte sample isolated by conventional means 
from a subject. 

In another embodiment, the methods further involve 
obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 

35 agent capable of detecting a protein or nucleic acid of 
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the invention, such that the presence of the protein or 
nucleic acid is detected in the biological sample, and 
comparing the presence of the protein or nucleic acid of 
the invention in the control sample with the presence of 
5 the protein or nucleic acid of the invention in the test 
sample . 

The invention also encompasses kits for detecting 
the presence of a protein or nucleic acid of the 
invention in a biological sample (a test sample) . Such 

10 kits can be used to determine if a subject is suffering 
from or is at increased risk of developing prostate 
cancer (e.g., an androgen- independent prostate cancer). 
For example, the kit can comprise a labeled compound or 
agent capable of detecting a protein or nucleic acid of 

15 the invention in a biological sample and means for 

determining the amount of the protein or nucleic acid of 
the invention in the sample (e.g., an antibody or an 
oligonucleotide probe) . Kits may also include 
instruction for observing that the tested subject is 

20 suffering from or is at risk of developing a prostate 
cancer if the amoiint of the protein or nucleic acid of 
the invention is above a normal level or is 
constitutively expressed in the absence of testosterone 
or another androgen. 

25 For antibody-based kits, the kit may comprise, for 

example: (1) a first antibody (e.g., attached to a solid 
support) which binds to the protein of the invention; 
and, optionally, (2) a second, different antibody which 
binds to the protein or the first antibody and is 

30 conjugated to a detectable agent. 

For oligonucleotide-based kits, the kit may 
comprise, for example: (1) an oligonucleotide, e.g., a 
detectably labeled oligonucleotide, which hybridizes to a 
nucleic acid of the invention or (2) a pair of primers 

35 useful for amplifying a nucleic acid of the invention. 
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The kit may also comprise, e.g., a buffering 
agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 
5 substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 
compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 
10 single package along with instructions for observing 
whether the tested subject is suffering from or is at 
risk of developing prostate cancer (e.g., androgen- 
independent prostate cancer) . 

2 . Prognostic Assays 

15 The methods described herein can furthermore be 

utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
disorder associated with aberrant expression or activity 
of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 

2 0 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19. For 
example, the assays described herein, such as the 
preceding diagnostic assays or the following assays, can 
be utilized to identify a subject having or at risk of 
developing a disorder associated with HRPCa 2, HPRCa 3, 

25 HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, or HRPCa 19 protein, nucleic acid 
expression (e.g., constitutive expression) or activity, 
e.g., androgen- independent prostate cancer. 
Alternatively, the prognostic assays can be utilized to 

30 identify a subject having or at risk for developing such 
a disease or disorder. Thus, the present invention 
provides a method in which a test sample (e.g., prostate 
cells) is obtained from a subject and a protein or 
nucleic acid (e.g., mRNA, genomic DNA) of the invention 
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is detected, wherein the presence of increased or 
constitutive expression of protein or nucleic acid of the 
invention is diagnostic for a subject having or at risk 
of developing prostate cancer (e.g., androgen- independent 
5 prostate cancer) . As used herein, a "test sample" refers 
to a biological sample obtained from a subject of 
interest. For example, a test sample can be a biological 
fluid (e.g., serum), cell sample, or tissue. 

Furthermore, the prognostic assays described 

10 herein can be used to determine whether a subject can be 
administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) prostate cancer. For 
example, such methods can be used to determine whether a 

15 subject can be effectively treated with a specific agent 
or class of agents (e.g., agents of a type which decrease 
activity of a protein of the invention or agents of a 
type which decrease expression of a protein or nucleic 
acid of the invention) . Thus, the present invention 

2 0 provides methods for determining whether a subject can be 
effectively treated with an agent for a disorder (e.g., 
prostate cancer) associated with aberrant expression 
(e.g., constitutive or non-androgen inducible expression) 
or activity of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 

25 HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 
19 in which a test sample is obtained and HRPCa 2, HPRCa 
3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, or HRPCa 19 protein or nucleic acid is 
detected. 

30 The methods of the invention can also be used to 

detect genetic lesions or mutations in a gene of the 
invention, thereby determining if a subject with the 
lesioned gene is at risk for a disorder associated with 
such a lesion. In preferred embodiments, the methods 

35 include detecting, in a sample of cells from the subject. 
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the presence or absence of a genetic lesion or mutation 
characterized by at least one of an alteration affecting 
the integrity of a gene of the invention, or the mis- 
expression of a protein or nucleic acid of the invention. 
5 For example, such genetic lesions can be detected by 
ascertaining the existence of at least one of: 1) a 
deletion of one or more nucleotides from a gene of the 
invention; 2) an addition of one or more nucleotides to a 
gene of the invention; 3) a substitution of one or more 

10 nucleotides of a gene of the invention; 4) a chromosomal 
rearrangement of a gene of the invention; 5) an 
alteration in the level of a messenger RNA transcript of 
a gene of the invention, 6) aberrant modification of a 
gene of the invention, such as of the methylation pattern 

15 of the genomic DNA, 7) the presence of a non-wild type 
splicing pattern of a messenger RNA transcript of a gene 
of the invention, 8) a non-wild type level of a protein 
of the invnetion, 9) allelic loss of a gene of the 
invention, 10) inappropriate post-translational 

20 modification of a protein of the invention, 11) loss of 
androgen- inducibility of a gene of the invention. As 
described herein, there are a large number of assay 
techniques known in the art which can be used for 
detecting lesions in a gene of the invention. A 

25 preferred biological sample is a peripheral blood 

leukocyte sample isolated by conventional means from a 
subject . 

In certain embodiments, detection of the lesion 
involves the use of a probe/primer in a polymerase chain 

30 reaction (PGR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as cinchor PGR or RACE PGR, or, 
alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al. (1994) Proc. Natl. Acad. Sci. USA 91:360- 

35 3 64) , the latter of which can be particularly useful for 
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detecting point mutations in the gene (see, e.g., 
Abravaya et al . (1995) Nucleic Acids Res. 23:675-682). 
This method can include the steps of collecting a sample 
of cells from a patient, isolating nucleic acid (e.g., 
5 genomic, mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more 
primers which specifically hybridize to a gene of the 
invention \mder conditions such that hybridization and 
amplification of the gene (if present) occurs, and 

10 detecting the presence or absence of an amplification 
product, or detecting the size of the amplification 
product and comparing the length to a control sample. It 
is anticipated that PGR and/or LCR may be desirable to 
use as a preliminary amplification step in conjunction 

15 with any of the techniques used for detecting mutations 
described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al . (1990) 
Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 

20 amplification system (Kwoh, et al . (1989) Proc. Natl. 
Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al . (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 

25 well )cnown to those of s)cill in the art. These detection 
schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in 
very low numbers . 

In an alternative embodiment, mutations in a gene 

30 of the invention from a sample cell can be identified by 
alterations in restriction enzyme cleavage patterns. For 
example, sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonucleases, and fragment length sizes are determined 

35 by gel electrophoresis and compared. Differences in 
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fragment length sizes between sample and control DNA 
indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,4 98,531) can be used to score for the presence of 
5 specific mutations by development or loss of a ribozyme 
cleavage site. 

In other embodiments, genetic mutations in a gene 
of the invention can be identified by hybridizing a 
sample and control nucleic acids, e.g., DNA or RNA, to 

10 high density arrays containing hundreds or thousands of 
oligonucleotides probes (Cronin et al . (199S) Human 
Mutation 7:244-255; Kozal et al . (1996) Wature Medicine 
2:753-759). For example, genetic mutations in a gene of 
the invention can be identified in two-dimensional arrays 

15 containing light -generated DNA probes as described in 
Cronin et al . supra. Briefly, a first hybridization 
array of probes can be used to scan through long 
stretches of DNA in a sample and control to identify base 
changes between the sequences by making linear arrays of 

20 sequential overlapping probes. This step allows the 

identification of point mutations. This step is followed 
by a second hybridization array that allows the 
characterization of specific mutations by using smaller, 
specialized probe arrays complementary to all variants or 

25 mutations detected. Each mutation array is composed of 
parallel probe sets, one complementary to the wild- type 
gene and the other complementary to the mutant gene . 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 

3 0 directly sequence the Gene of the invention and detect 
mutations by comparing the sequence of the sample with 
the corresponding wild- type (control) sequence. Examples 
of sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) Proc. Natl. Acad. 

35 Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. 
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Sci . USA 74:5463). It is also contemplated that any of a 
variety of automated sequencing procedures can be 
utilized when perfoinning the diagnostic assays ((1995) 
Bio/Techniques 19:448) , including sequencing by mass 
5 spectrometry (see, e.g., PCT Publication No. WO 94/16101; 
Cohen et al . (1996) Adv. Chromatogr. 36:127-162; and 
Griffin et al . (1993) Appl . Biochem. Biotechnol . 38:147- 
159) . 

Other methods for detecting mutations in the gene 

10 of the invention include methods in which protection from 
cleavage agents is used to detect mismatched bases in 
RNA/RNA or RNA/DNA heteroduplexes (Myers et al . (1985) 
Science 230:1242). In general, the technique of 
"mismatch cleavage" entails providing heteroduplexes 

15 formed by hybridizing (labeled) RNA or DNA containing the 
wild-type the sequence with potentially mutant RNA or DNA 
obtained from a tissue sample. The double -stranded 
duplexes are treated with an agent which cleaves single- 
stranded regions of the duplex such as which will exist 

20 due to basepair mismatches between the control and sample 
strands. RNA/DNA duplexes can be treated with RNase to 
digest mismatched regions, and DNA/DNA hybrids can be 
treated with SI nuclease to digest mismatched regions. 
In other embodiments, either DNA/DNA or RNA/DNA duplexes 

25 can be treated with hydroxylamine or osmium tetroxide and 
with piperidine in order to digest mismatched regions. 
After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing 
polyacryl amide gels to determine the site of mutation. 

30 See, e.g.. Cotton et al (1988) Proc . Natl. Acad. Sci. USA 
85:4397; Saleeba et al (1992) Methods Enzymol. 217:286- 
295. In a preferred embodiment, the control DNA or RNA 
can be labeled for detection. 

In still another embodiment, the mismatch cleavage 

35 reaction employs one or more proteins that recognize 
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mismatched base pairs in double -stranded DNA (so called 
"DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in a cDNA obtained 
from samples of cells. For example, the mutY enzyme of 
5 E. coli cleaves A at G/A mismatches and the thymidine DNA 
glycosylase from HeLa cells cleaves T at G/T mismatches 
(Hsu et al. (1994) Carcinogenesis 15:1657-1662). 
According to an exemplary embodiment, a probe based on 
the sequence of a gene of the invention, is hybridized to 

10 a cDNA or other DNA product from a test cell(s) . The 

duplex is treated with a DNA mismatch repair enzyme, and 
the cleavage products, if any, can be detected from 
electrophoresis protocols or the like. See, e.g., U.S. 
Patent No. 5,459,039. 

15 In other embodiments, alterations in 

electrophoretic mobility will be used to identify 
mutations in a gene of the invention For example, single 
strand conformation polymorphism (SSCP) may be used to 
detect differences in electrophoretic mobility between 

20 mutant and wild type nucleic acids (Orita et al. (1989) 
Proc. Natl. Acad. Sci . USA 86:2766; see also Cotton 
(1993) Mutat. J?es. 285:125-144; Hayashi (1992) Genet Anal 
Tech Appl 9:73-79). Single- stranded DNA fragments of 
sample and control nucleic acids will be denatured and 

25 allowed to renature. The secondary structure of single- 
stranded nucleic acids varies according to sequence, and 
the resulting alteration in electrophoretic mobility 
enables the detection of even a single base change. The 
DNA fragments may be labeled or detected with labeled 

30 probes. The sensitivity of the assay may be enhanced by 
using RNA (rather than DNA) , in which the secondary 
structure is more sensitive to a change in sequence. In 
a preferred embodiment, the subject method utilizes 
heteroduplex analysis to separate double stranded 

35 heteroduplex molecules on the basis of changes in 
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electrophoretic mobility (Keen et al . (1991) Trends Genet 
7:5) . 

In yet another embodiment, the movement of mutant 
or wild- type fragments in polyacrylamide gels containing 
5 a gradient of denaturant is assayed using denaturing 
gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Wature 313:495) . VJhen DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of 

10 approximately 40 bp of high-melting GC-rich DNA by PGR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys Chem 255:12753). 

15 Examples of other techniques for detecting point 

mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 

20 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Wature 324:163); Saiki et al . (1989) Proc . Natl Acad. Sci 
USA 86:6230). Such allele specific oligonucleotides are 

25 hybridized to PGR amplified target DNA or a number of 
different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification 

3 0 technology which depends on selective PGR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 

35 differential hybridization) (Gibbs et al. (1989) Nucleic 
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Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent or reduce polymerase extension (Prossner (1993) 
rijbtech 11:238). In addition, it may be desirable to 
5 introduce a novel restriction site in the region of the 
mutation to create cleavage -based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 

10 (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

15 amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

20 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a gene of the invention, e.g., prostate 
cancer. 

Furthermore, any cell type or tissue, preferably 
25 prostate cells, in which HRPCa 2, HPRCa 3, HRPCa 6/7, 

HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
or HRPCa 19 is expressed may be utilized in the 
prognostic assays described herein. 

30 3 . Pharmacoqenomics 

Agents, or modulators which have an inhibitory 
effect on the expression of a nucleic acid or protein of 
the invention or the activity of a protein of the 
invention as identified by a screening assay described 

35 herein can be administered to individuals to treat 
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(prophylactically or therapeutically) prostate cancer. 
In conjunction with such treatment, the pharmacogenomics 
(i.e., the study of the relationship between an 
individual ' s genotype and that individual ' s response to a 
5 foreign compound or drug) of the individual may be 
considered. Differences in metabolism of therapeutics 
can lead to severe toxicity or therapeutic failure by 
altering the relation between dose and blood 
concentration of the pharmacologically active drug. Thus, 

10 the pharmacogenomics of the individual permits the 
selection of effective agents (e.g., drugs) for 
prophylactic or therapeutic treatments based on a 
consideration of the individual's genotype. Such 
pharmacogenomics can further be used to determine 

15 appropriate dosages and therapeutic regimens. 

Accordingly, the activity of a protein of the invention 
or the expression of a nucleic acid or protein of the 
invention or the mutation content of a gen of the 
invention in an individual can be determined to thereby 

20 select appropriate agent (s) for therapeutic or 
prophylactic treatment of the individual. 

Pharmacogenomics deals with clinically significant 
hereditary variations in the response to drugs due to 
altered drug disposition and abnormal action in affected 

25 persons. See, e.g., Linder (1997) Clin. Chem. 

43 (2) : 254-266 . In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions 
transmitted as a single factor altering the way drugs act 
on the body are referred to as "altered drug action." 

30 Genetic conditions transmitted as single factors altering 
the way the body acts on drugs are referred to as 
"altered drug metabolism." These pharmacogenetic 
conditions can occur either as rare defects or as 
polymorphisms . 
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Thus, the activity of a protein of the invention 
or the expression of a nucleic acid or protein of the 
invention or the mutation content of a gene of the 
invention in an individual can be determined to thereby 
5 select appropriate agent (s) for therapeutic or 

prophylactic treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping 
of polymorphic alleles encoding drug-metabolizing enzymes 
to the identification of an individual's drug 

10 responsiveness phenotype. This knowledge, when applied 
to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a 
modulator of the expression or activity of HRPCa 2, 

15 HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
HRPCa 14, HRPCa 15, or HRPCa 19 such as a modulator 
identified by one of the exemplary screening assays 
described herein. 

4 . Monitoring of Effects During Clinical Trials 
20 Monitoring the influence of agents (e.g., drugs, 

compounds) on the expression or activity of HRPCa 2, 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
HRPCa 14, HRPCa 15, or HRPCa 19 can be applied not only 
in basic drug screening, but also in clinical trials. 
25 For example, the effectiveness of an agent, as determined 
by a screening assay, to decrease expression or activity 
of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 
10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19, can be 
monitored in clinical trials of subjects exhibiting 
30 increased expression (or constitutive expression) or 

increase activity of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 
8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or 
HRPCa 19. 
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For example, in a clinical trial, cells can be 
isolated and RNA prepared and analyzed for the levels of 
expression of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 
HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 
5 19 (or various combinations thereof) . The levels of gene 
expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as 

10 described herein, or by measuring the levels of activity. 
In this way, the gene expression pattern can serve as a 
marker, indicative of the physiological response of the 
cells to the agent. Accordingly, this response state may 
be determined before, and at various points during, 

15 treatment of the individual with the agent. 

In a preferred embodiment, the present invention 
provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 

20 acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression or activity of HRPCa 

25 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 
13, HRPCa 14, HRPCa 15, or HRPCa 19 the preadministration 
sample; (iii) obtaining one or more post-administration 
samples from the subject; (iv) detecting the level of 
expression or activity of HRPCa 2, HPRCa 3, HRPCa 6/7, 

30 HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
or HRPCa 19 in the post -administration samples; (v) 
comparing the level of expression or activity of HRPCa 
2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 
13, HRPCa 14, HRPCa 15, or HRPCa 19 in the pre- 

35 administration sample v/ith the level of expression or 
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activity of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 
9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 in 
the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject 
5 accordingly. For example, increased administration of 
the agent may be desirable to decrease the expression or 
activity of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 
9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 
lower levels than detected, i.e., to increase the 
10 effectiveness of the agent. 

C . Methods of Treatment 
The present invention provides for both 
prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or 
15 having a disorder associated with aberrant HRPCa 2, 

HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
HRPCa 14, HRPCa 15, or HRPCa 19 expression or activity. 
Such disorders include prostate cancer, especially 
androgen-resistant prostate cancer. 

2 0 1 . Prophylactic Methods 

In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 
associated with an aberrant HRPCa 2, HPRCa 3, HRPCa 6/7, 
HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
25 or HRPCa 19 expression or activity, by administering to 
the subject an agent which modulates HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, or HRPCa 19 expression or at least one 
HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, 

3 0 HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 activity. 

Subjects at risk for a disease (e.g., prostate cancer) 
which is caused by or contributed to by aberrant HRPCa 2 , 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
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HRPCa 14, HRPCa 15, or HRPCa 19 expression or activity- 
can be identified by, for example, any or a combination 
of diagnostic or prognostic assays as described herein. 
Administration of a prophylactic agent can occur prior to 
5 the manifestation of symptoms characteristic of the 

disease or disorder such that the disease or disorder is 
prevented or, alternatively, delayed in its progression. 

2 . Therapeutic Methods 

Another aspect of the invention pertains to 

10 methods of modulating HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 
8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or 
HRPCa 19 expression or activity for therapeutic purposes. 
The modulatory method of the invention involves 
contacting a cell with an agent that modulates one or 

15 more of the activities of HRPCa 2, HPRCa 3, HRPCa 6/7, 
HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
or HRPCa 19. An agent that modulates HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, or HRPCa 19 activity can be an agent as 

20 described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand, a peptide, a- 
peptidomimetic, or other small molecule. Useful agents 
inhibit one or more of the biological activities HRPCa 
2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 

25 13, HRPCa 14, HRPCa 15, or HRPCa 19. Examples of such 
inhibitory agents include antisense nucleic acid 
molecules and antibodies. These modulatory methods can 
be performed in vitro (e.g., by culturing the cell with 
the agent) or, alternatively, in vivo (e.g, by 

3 0 administering the agent to a subject) . As such, the 
present invention provides methods of treating an 
individual afflicted with a disease or disorder 
characterized by aberrant expression or activity of 
HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, 
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HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19. In one 
embodiment, the method involves administering an agent 
(e.g., an agent identified by a screening assay described 
herein), or combination of agents that modulates (e.g., 
5 downregulates) HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 

HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 
19 expression or activity. 

Agents which modulate (reduce) the expression or 
activity of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 

10 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 19 
can be co-administered with an anti-androgenic compound. 
In addition, an agent which modulates (reduces) the 
expression or activity of HRPCa 2, HPRCa 3, HRPCa 6/7, 
HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 

15 or HRPCa 19, optionally co- administered with an anti- 
androgenic compound, can be administered to a patient in 
conjunction with other anti-cancer therapies, e.g., other 
chemotherapies, radiation therapy, or anti- 
neovascularization therapy. 

20 VI ■ Prostate Cancer 

The expression level of HRPCa 2, HPRCa 3, HRPCa 
6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, 
HRPCa 15, or HRPCa 19 may be used to assess patients 
undergoing clinical evaluation for the treatment of 

25 prostate cancer. Thus, the genes of the invention can be 
used to select an appropriate therapy for an individual 
and to monitor an ongoing therapy. They may also be 
utilized as surrogate markers to monitor clinical human 
trials of a drug being tested for efficacy as a prostate 

3 0 cancer treatment. In either case, one or more of the 
genes of the invention may be utilized as a marker. 

One can determine whether a given patient will 
benefit from anti-androgenic therapy by examining the 
expression level of one or more of the genes of the 



wo 00/18961 



PCT/US99/22535 



- 77 - 

invention. If one or more of these genes is more highly 
expressed in the patient's prostate cancer cells in the 
presence of testosterone than in the absence of 
testosterone, the prostate cancer cells are more likely 
5 to respond to anti -androgenic therapy. As used herein, 
" ant i -androgenic therapy" refers broadly to any therapy 
which interferes with or blocks the synthesis or action 
of an androgen. Similarly, "anti- testosterone therapy" 
refers broadly to any therapy which interferes with or 

10 blocks the synthesis or action of testosterone. Such 

therapies can, for example, block ligand receptor binding 
(e.g., binding of testosterone to its receptor), receptor 
signaling (e.g., signal transduction mediated by the 
binding of testosterone to its receptor) , or the activity 

15 or expression of a downstream regulated gene. 

The biological function of HRPCa 2, HPRCa 3, 
HRPCa'6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, and HRPCa 19 can be more directly assessed 
by utilizing relevant in vivo and in vitro systems. In 

20 vivo systems can include, but are not limited to, animal 
systems which naturally exhibit symptoms of prostate 
cancer or ones which have been engineered to exhibit such 
symptoms . 

The role of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 
25 HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, and 

HRPCa 19 can be determined by transfecting cDNAs encoding 
these gene products into appropriate cell lines, such as, 
for example, a prostate cancer cell line and analyzing 
the effect of the gene product on cell growth. 
30 In further characterizing the biological function 

of the genes of the invention, the expression of these 
genes can be modulated within the in vivo and/or in vitro 
systems, i.e., either over- expressed or under-expressed, 
and the subsequent effect on the system then assayed. 
35 Alternatively, the activity of the product of the 
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identified gene can be modulated by either increasing or 
decreasing the level of activity in the in vivo and/or in 
vitro system of interest, and assessing the effect of 
such modulation. 
5 The information obtained through such 

characterizations can suggest relevant methods for the 
treatment of prostate cancer. For example, treatment can 
include a modulation of gene expression and/or gene 
product activity. Characterization procedures such as 
10 those described herein can indicate where such modulation 
should involve an increase or a decrease in the 
expression or activity of the gene or gene product of 
interest . 

Described below are in vitro and in vivo systems 
15 which can be used to further characterize the genes of 
the invention. These system can also be used as part of 
screening strategies designed to identify compounds which 
are capable of preventing and/or ameliorating symptoms of 
prostate cancer. Thus, these systems can be used to 
20 identify drugs, pharmaceuticals, therapies and 

interventions which can be effective in treating prostate 
cancer and to determine the in vivo efficacy of drugs, 
pharmaceuticals, therapies and interventions. 

1 . In Vitro Systems 

25 Cells that contain and express a gene of the 

invention and exhibit cellular phenotypes associated with 
prostate cancer (e.g, androgen- independent prostate 
cancer) , can be utilized to identify compounds that 
exhibit an ability to prevent and/or treat prostate 

30 cancer. 

Further, pattern of expression of some or all of 
the genes of the invention in cells of interest can be 
analyzed and compared to the normal expression pattern. 
Those compounds which cause cells exhibiting cellular 
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phenotypes of prostate cancer to produce an expression 
pattern more closely resembling a normal expression 
pattern for the cell of interest can be considered 
candidates for further testing regarding an ability to 
5 ameliorate the symptoms of prostate cancer. 

Cells which will be utilized for such assays can, 
for example, include LNCaP cells. In addition, purified 
primary or secondary tumor cells derived from either 
transgenic' or non-transgenic tumor cells can be used. 

10 Further, cells which can be used for such assays 

can also include recombinant, transgenic cell lines. 
While primary cultures derived from the metastasis in 
transgenic animals can be utilized, the generation of 
continuous cell lines is preferred. For examples of 

15 techniques which can be used to derive a continuous cell 
line from a transgenic animal (Small et al . (1985) ftfol. 
Cell Biol. 5:642-648) . 

Alternatively, prostate cancer cells can be 
transfected with sequences capable of increasing or 

20 decreasing the expression of a gene of the invention 
within the cell. For example, a gene of the invention 
can be introduced into, and over expressed in, the genome 
of the cell of interest, or, if endogenous target gene 
sequences are present, they can either be overexpressed 

25 or, alternatively, be disrupted in order to underexpress 
or inactivate target gene expression. 

In order to overexpress a target gene sequence, 
the coding portion of the target gene sequence can be 
ligated to a regulatory sequence which is capable of 

30 driving gene expression in the cell type of interest. 
Such regulatory regions will be well known to those of 
skill in the art. 

For under expression of an endogenous target gene 
sequence, such a sequence can be isolated and engineered 

35 such that when reintroduced into the genome of the cell 
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type of interest, the endogenous target gene alleles will 
be inactivated. Preferably, the engineered target gene 
sequence is introduced via gene targeting such that the 
endogenous target sequence is disrupted upon integration 
5 of the engineered target gene sequence into the cell's 
genome . 

Transfection of target gene sequence nucleic acid 
can be accomplished by utilizing standard techniques 
(see, e.g., Ausubel, supra). Transfected cells should be 

10 evaluated for the presence of the recombinant target gene 
sequences, for expression and accumulation of target gene 
mRNA, and for the presence of recombinant target gene 
protein production. In instances wherein a decrease in 
target gene expression is desired, standard techniques 

15 can be used to demonstrate whether a decrease in 

endogenous target gene expression and/or in target gene 
product production is achieved. 

2 . In Vivo Systems 
In vivo systems of prostate cancer can be either 
20 non- recombinant animals or recombinantly engineered 

transgenic animals. Such models may be generated, for 
example, by introducing tumor cells into syngeneic mice 
using techniques such as implanting prostatic cancer 
cells into the prostate gland. After an appropriate 
25 period of time, the -tumors which result from these 
injections can be counted and analyzed. 

For the generation of animal models of prostate 
cancer, cells derived from, for example, a prostate 
cancer cell line may be implanted into the prostate of an 
3 0 animal and the resulting tumors may be analyzed and 
compared to, for example, normal tissue. 

The role of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 
HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 
19 can be determined by trans feet ing cDNAs encoding such 
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the gene product into the appropriate cell line and 
analyzing its effect on the cells' ability to induce 
prostate cancer in animal models such as these. The role 
of the identified gene products may be further analyzed 
5 by, for example, culturing cells derived from the tumors 
which develop in the animal models, introducing these 
cultured cells into animals, and subsequently measuring 
the level of identified gene product present in the' 
resulting tumor cells. In this manner, cell line 

10 variants are developed which can be useful in analyzing 
the role of quantitative and/or qualitative differences 
in the expression of the identified genes on the cells' 
ability to induce prostate cancer. 

Recombinant animal models for prostate cancer can 

15 be engineered by utilizing, for example, target gene 

sequences such as those described herein in conjunction 
with techniques for producing transgenic animals that are 
well known to those of skill in the art. For example, 
target gene sequences can be introduced into, and 

20 overexpressed in, the genome of the animal of interest, 
or, if endogenous target gene sequences are present, they 
can either be overexpressed or, alternatively, can be 
disrupted in order to underexpress or inactivate target 
gene expression. 

25 In order to overexpress a gene of the invention, 

the coding portion of the gene sequence can be ligated to 
a regulatory sequence which is capable of driving gene 
expression in the animal and cell type of interest. Such 
regulatory regions will be well known to those of skill 

30 in the art. 

In order to obtain underexpress ion of an 
endogenous target gene sequence, such a sequence can be 
introduced into the genome of the animal of interest such 
that the endogenous target gene alleles will be 

35 inactivated. Preferably, an engineered sequence 
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comprising at least part of the target gene sequence is 
utilized and is introduced, via gene targeting, such that 
the endogenous target sequence is disrupted upon 
integration of the engineered target gene sequence into 
5 the animal's genome. 

Animals of any species, including, but not limited 
to, mice, rats, rabbits, guinea pigs, pigs, micro-pigs, 
goats, and non-human primates, e.g., baboons, monkeys, 
and chimpanzees can be used to generate animal models of 

10 prostate cancer. 

Any technique known in the art can be used to 
introduce a gene of the invention into animals to 
produce the founder lines of transgenic animals, as 
described above. 

15 Once transgenic animals have been generated, the 

expression of the recombinant target gene and protein can 
be assayed utilizing standard techniques. Initial 
screening can be accomplished by Southern blot analysis 
or PGR techniques to analyze animal tissues to assay 

20 whether integration of the transgene has taken place. 
The level of mRNA expression of the transgene in the 
tissues of the transgenic animals can also be assessed 
using techniques which include, but are not limited to. 
Northern blot analysis of tissue samples obtained from 

25 the animal, in situ hybridization analysis, and RT- 

coupled PGR. Samples of target gene -expressing tissue, 
can also be evaluated immunocytochemically using 
antibodies specific for the transgenic product of 
interest . 

30 The transgenic animals that express a recombinant 

nucleic acid or protein of the invention are further 
evaluated to identify those animals which display 
prostate cauicer characteristics. 

Additionally, specific cell types within the 

35 transgenic animals can be analyzed for cellular 



1 
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phenotypes characteristic of prostate cancer. Such 
cellular phenotypes can include, for example, 
differential gene expression characteristic of prostate 
cancer cells. Further, such cellular phenotypes can 
5 include as assessment of a particular cell type gene 

expression profile and its comparison to known expression 
profiles of the particular cell type in animals 
exhibiting prostate cancer. Such transgenic animals 
serve as suitable model systems for prostate cancer. 

10 Once target gene transgenic founder animals are 

produced (i.e., those animals which express a protein of 
the invention in cells or tissues of interest, and which, 
preferably, exhibit prostate cancer characteristics) , 
they can be bred, inbred, outbred, or crossbred to 

15 produce colonies of the particular animal. Examples of 
such breeding strategies include but are not limited to: 
outbreeding of founder animals with more than one 
integration site in order to establish separate lines; 
inbreeding of separate lines in order to produce compound 

20 transgenics that express the transgene of interest at 
higher levels because of the effects of additive 
expression of each transgene; crossing of heterozygous 
transgenic animals to produce animals homozygous for a 
given integration site in order to both augment 

25 expression and eliminate the possible need for screening 
of animals by DNA analysis; crossing of separate 
homozygous lines to produce compound heterozygous or 
homozygous lines; breeding animals to different inbred 
genetic backgrounds so as to examine effects of modifying 

30 alleles on expression of the target gene transgene and 

the development of symptoms of prostate cancer. One such 
approach is to cross the transgenic founder animals with 
a wild- type strain to produce an Fl generation that 
exhibits symptoms of prostate cancer. The Fl generation 

35 can then be inbred in order to develop a homozygous line, 
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if it is found that homozygous transgenic animals are 
viable. 

This invention is further illustrated by the 
following examples which should not be construed as 
5 limiting. The contents of all references, patents and 
published patent applications cited throughout this 
application are hereby incorporated by reference. 



EXAMPLES 



Example 1 : Identification of Genes that are 

10 Constitutively Expressed in Androgen- 

Independent Prostate Cancer Cells 
HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, 
HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, and were 
identified using a strategy designed to identify genes 

15 whose expression is induced by testosterone (or a similar 
androgen) in androgen- dependent prostate cancer cells and 
are constitutively expressed in androgen- independent 
prostate cancer cells. 

WT LNCaP cells (androgen-dependent prostate cancer 

20 cells) v/ere routinely grown in T162 flasks coated with 
Matrigel in RPMI-1640 medium supplemented with 10% FBS 
and 50 nM testosterone. 

To identify androgen regulated genes, LNCaP cells 
were first incubated in the absence of androgens and then 

25 treated with either testosterone or casodex. Ten T162 
flasks of LNCaP cells were pre-incubated for 24.5 ho\irs 
in dye-free RPMI-1640 containing 2% charcoal stripped 
serum. Following pretreatment, five T162 flasks of 
pretreated cells were treated with testosterone- 

30 containing medium (dye-free RPMI-1640, 2% CSS, 100 nM 
testosterone, 0.09% DMSO) , and five T162 flasks of 
pretreated cells were treated with casodex-containing 
medium (dye-free RPMI-1640, 2% CSS, 100 /xM casodex, 0.09% 
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DMSO) . After 25 hours of incubation in testosterone- 
containing medium or casodex-containing medium, the cells 
were detached from the flasks with trypsin and pelleted 

Total RNA was prepared from the cell pellets using 
5 the RNeasy protocol (Qiagen) . Approximately 260 fig of 
total RNA was obtained from each cell pellet. Next, 
polyA+ RNA was prepared form approximately 24 0 ng of each 
total RNA sample using the Oligotex protocol (Qiagen) , 
approximately 6 fig of polyA+ RNA was obtained from each 
10 24 0 fig total RNA sample, and 2 fig of each polyA+ RNA 
sample was used for the generation of subtraction 
libraries using the PCR-select protocol (Clontech; Palo 
Alto, CA) . 

The PCR products, representing partial cDNAs of 
15 putatively differentially expressed Minas, were subcloned 
into pCR2.1 (InVitrogen) and transformed into INValphaF^ 
cells . 

The cDNA inserts from individual clones of the 
subtraction libraries were PCR amplified and spotted onto 

20 nylon to generated high-density arrays. The arrays were 
probed with first strand cDNA from: WT LNCaP cells 
treated with 100 nM testosterone, WT LNCaP cells treated 
with 100 /iM casodex, LN3 LNCaP cells (an androgen 
independent variant of LNCaP cells) treated with 100 nM 

25 testosterone, LN3 LNCaP cells treated with 100 /iM 

casodex, WT LNCaP cells treated with 1 nM R1881, and WT 
LNCaP cells treated with stripped serum. Quantitative 
analysis of radioactive signal at each cDNA spot was 
performed to identify the relative differential 

30 expression of each cDNA among the above cell lines and 
drug treatments. The clones chosen for further analysis 
demonstrated higher expression levels in WT LNCaP cells 
treated with testosterone than in WT LNCaP cells treated 
with casodex and equal expression levels in LN3 cells 

35 treated with testosterone or casodex. 
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Among the clones selected for further analysis 
were HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 
10, HRPCa 13, HRPCa 14, HRPCa 15, and HRPCa 19. HRPCa 9 
and HRPCa 10 appear to be novel genes. A full-length 
5 cDNA sequence for each gene was determined. These 
sequences, shown in Figures 1- 9, as well as the known 
sequences of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 

13, HRPCa 14, HRPCa 15, or HRPCa 19 can be used to 
produce the polypeptides and nucleic acids of the 

10 invention. 

Example 2 : Screening for Compounds Useful for the 
Treatment of Prostate Cancer 
Compounds potentially useful for the treatment of 
prostate cancer can be identified as follows. Prostate 

15 cancer cells (e.g., VJT LNCaP cells) are stably 

transfected with a vector capable of constitutively 
expressing HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 
9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, or HRPCa 
19 (e.g., a vector in which expression of HRPCa 2, HPRCa 

20 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 

14, HRPCa 15, or HRPCa 19 is under the control of the CMV 
IE promoter) . The transfected WT LNCaP cells are 
cultured under suitable conditions (e.g., in T162 flasks 
coated with Matrigel in RPMI-164 0 medium supplemented 

25 with 10% FBS and 50 nM testosterone) in the presence and 
absence of a test compound and the growth rate of the 
cells is measured. A compound which reduces the growth 
rate of the cells is a potential therapeutic compound for 
the treatment of prostate cancer. A potential 

30 therapeutic compound identified in this manner can be 

further analyzed by determining its effect on the growth 
rate of androgen- independent prostate cancer cells (e.g., 
LN3 LNCaP cells) . 



/ 



wo 00/18961 



PCT/US99/22535 



- 87 - 

The cDNA and protein sequences shown in the 
Figures and in publications or sequence databases 
referring to HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 
13, HRPCa 14, HRPCa 15, or HRPCa 19 provide those skilled 
in the art with the genes needed to prepared the 
transfected cell lines useful in for screening assays. 

Equivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. 

What is claimed is: 
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1. A method for identifying a compound useful 
for treating prostate cancer, comprising: 

a) measuring the expression level of HRPCa 2, 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 

5 HRPCa 14, HRPCa 15, HRPCa 19, or peripheral -type 

benzodiazepine receptor (PBR) in a cell in the presence 
of a test compound; and 

b) comparing the expression measured in step a) to 
the expression of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 

10 HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 
19, or PBR in a cell in the absence of the compound, 

wherein the compound is useful for treating 
prostate cancer when the expression level of HRPCa 2, 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 

15 HRPCa 14, HRPCa 15, HRPCa 19, or PBR in the presence of 
the test compound is less than its expression level in 
the absence of the test compound. 



2. The method of claim 1 wherein the expression 
level is determined by measuring the amount of HRPCa 2, 
20 HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
HRPCa 14, HRPCa 15, HRPCa 19, or PBR mRNA. 



3 . The method of claim 1 wherein the expression 
level is determined by measuring the amount of HRPCa 2, 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
25 HRPCa 14, HRPCa 15, HRPCa 19, or PBR protein. 



4 . The method of claim 1 wherein the cell is a 
prostate cancer cell. 

5. The method of claim 1 further comprising 
measuring the expression level of HRPCa 2, HPRCa 3, HRPCa 

30 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, 
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HRPCa 15, HRPCa 19, or PBR in the cell in the absence of 
the test compound. 

6. The method of claim 4 wherein the prostate 
cancer cell is an androgen independent prostate cancer 

5 cell. 

7 . The method of claim 4 wherein the prostate 
cancer cell constitutively expresses HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, HRPCa 19, or PBR. 

10 8. A method for identifying a compound useful 

for treating prostate cancer, comprising 

a) measuring an activity of HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, HRPCa 19, or peripheral -type benzodiazepine 

15 receptor (PBR) in the presence of the compound; and 

b) comparing the activity measured in step a) to 
the level of activity of HRPCa 2, HPRCa 3, HRPCa 6/7, 
HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
HRPCa 19, or PBR in the absence of the test compound, 

20 wherein the compound is useful for treating 

prostate cancer when the activity of HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, HRPCa 19, or PBR in the presence of the 
test compound is lower than its activity in the absence 

25 of the test compound. 

9. The method of claim 8 wherein the activity is 
measured in a cell . 

10. The method of claim 9 wherein the cell is a 
prostate cancer cell. 
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11. The method of claim 10 wherein in the 
prostate cancer cell is an androgen- independent prostate 
cancer cell. 

12 . A method for identifying a compound useful 
5 for treating prostate cancer, comprising 

a) measuring the growth of a sample of cells 
which cons ti tut ively express HRPCa 2, HPRCa 3, HRPCa 6/7, 
HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
HRPCa 19, or peripheral -type benzodiazepine receptor 

10 (PBR) in the presence of the test compound; and 

b) comparing the growth measured in step a) to 
the growth of a sample of the cells in the absence of the 
test compound, 

wherein the compound is useful for treating 
15 prostate cancer when the growth of the sample of cells in 
the presence of the test compound is lower than the 
growth of the sample of cells in the absence of the test 
compound . 

13. The method of claim 12 wherein the cell 
20 sample comprises prostate cancer cells. 

14. The method of claim 13 wherein the prostate 
cancer cells are androgen- independent prostate cancer 
cells . 

15. The method of claim 13 wherein growth is 
25 measured in the presence of an androgen. 

16. A method of treating prostate cancer in a 
patient, comprising administering to the patient a 
compound which decreases the expression of HRPCa 2, HPRCa 
3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
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14, HRPCa 15, HRPCa 19, or peripheral -type benzodiazepine 
receptor (PBR) . 

17. The method of claim 16 wherein the compound 
decreases expression of HRPCa 2, HPRCa 3, HRPCa 6/7, 

5 HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 
HRPCa 19, or PBR mRNA. 

18. The method of claim 16 wherein the compound 
decreases expression of HRPCa 2, HPRCa 3, HRPCa 6/7, 
HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, 

10. HRPCa 19, or PBR protein. 

19. The method of claim 16 wherein the prostate 
cancer is androgen -dependent . 

20. The method of claim 16 wherein the prostate 
cancer is androgen- independent . 

15 21. A method of treating prostate cancer in a 

patient, comprising administering to the patient a 
compound which decreases an activity of HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, HRPCa 19, or peripheral benzodiazepine 

2 0 receptor (PBR) . 

22 . The method of claim 21 wherein the prostate 
cancer is androgen -dependent . 

23 . The method of claim 22 wherein the prostate 
cancer is androgen- independent . 

25 24 . A method for diagnosing androgen- independent 

prostate cancer in a patient undergoing androgen 
withdrawal treatment, comprising: 
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a) providing a biological sample from the patient 
comprising prostate cells; 

b) measuring the expression of HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 

5 14, HRPCa 15, HRPCa 19, or peripheral -type benzodiazepine 
receptor (PBR) in the sample in the presence and absence 
of androgen; 

c) determining whether the expression of HRPCa 2, 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 

10 HRPCa 14, HRPCa 15, HRPCa 19, or PBR is increased in the 
presence androgen compared to the absence of androgen, 
wherein the prostate cancer cells are androgen 
independent if the expression HRPCa 2, HPRCa 3, HRPCa 
6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, 

15 HRPCa 15, HRPCa 19, or PBR is not increased in the 

presence of androgen compared to the absence of androgen. 

25. A method for determining the efficacy of 
androgen withdrawal treatment in a prostate cancer 
patient, comprising: 

20 a) providing a first biological sample from the 

patient at a first time point and determining the 
expression level of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 
HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 
19, or peripheral benzodiazepine receptor (PBR) in the 

25 first sample; 

b) providing a second biological sample from the 
patient at a second time point after the first time 
point, the second time point occurring after the patient 
begins androgen withdrawal treatment, and determining the 

3 0 expression level of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 
HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 
19, or PBR in the second sample; 

c) comparing the expression level of HRPCa 2, 
HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
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HRPCa 14, HRPCa 15, HRPCa 19, or PBR in the first sample 
with that in the second sample, wherein an increase in 
the expression level in the second sample compared with 
the first sample indicates that the androgen withdrawal 
5 treatment has become less effective. 

26. The method of claim 16 further comprising co- 
administering an ant i- androgenic agent. 

27. A method for determining whether a patient is 
suffering from androge- independent prostate cancer, 

10 comprising: 

a) providing a sample of prostate cells from the 
patient; 

a) measuring the expression of one or more of 
HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, 

15 HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 19, or peripheral 
benzodiazepine receptor (PBR) in the prostate cells in 
the presence and absence of an androgen; 

b) determining that the patient is suffering from 
androgen independent prostate cancer when the expression 

20 of one or more of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 
HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 
19, and PBR is not higher in the presence of the androgen 
than in the absence of the androgen. 

28. A method for determining whether a patient 
25 suffering from prostate cancer should be treated with an 

agent which decreases the expression of HRPCa 2, HPRCa 3, 
HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 
14, HRPCa 15, HRPCa 19, or peripheral benzodiazepine 
receptor (PBR) comprising: 
30 a) measuring the expression of one or more of 

HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, 
HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 19, and PBR in the 
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prostate cells in the presence and absence of an 
androgen ; 

b) determining that the patient should be treated 
with an agent which decreases the expression of HRPCa 2 , 
5 HPRCa 3, HRPCa 6/7, HRPCa 8, HRPCa 9, HRPCa 10, HRPCa 13, 
HRPCa 14, HRPCa 15, HRPCa 19, or PBR when the expression 
of one or more of HRPCa 2, HPRCa 3, HRPCa 6/7, HRPCa 8, 
HRPCa 9, HRPCa 10, HRPCa 13, HRPCa 14, HRPCa 15, HRPCa 
19, and PBR is not higher in the presence of the androgen 
10 than in the absence of the androgen. 
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ggcggctcgg 
gtcgc-tgtc 
cgcgcctcag 
gaattccccc 
ttagaagctg 
gagaaacgag 
agcaaaacgg 
gcacaaatgg 
cggaagaaca 
agaactgact 
ttttactttt 
ccagatccag 
actctttttc 
act:tcagrac 
tarttgtgaa 
acrtttcaat 
aaaaccgcac 
cacagagccc 
cccaacgcac 
tgtctccttc 
ctttcagatg 
ttaaaagcac 
tggtttgctt 
tacgacacga 



actgagcagg 
ttctattcac 
gccaggcttt 
tttcccctcc 
cagaagaaag 
agcacgagaa 
cagaagagaa 
crgccaaact 
aagaatccaa 
ttctccccat 
tccctcctga 
actgtaagat 
taaagtgttg 
acttggtgta 
aagagtatgt 
cccaatrctg 
agtgctgttg 
ctcctggttg 
aaatgaccgc 
cacaatgact 
tttatttgca 
aagcgtgttt 
gctctacgtt 
gaatcagaat 



actttcctta 
catggcttct 
tgagctgatt 
aaagaagaag 
acgcaagccc 
agaagtgctt 
actgacccac 
ggaacgtctg 
agaccctgct 
ccccttccta 
caaacatctt 
gttgrtttag 
gtctrtctaa 
ctgggttaat 
agcggcttct 
tcccaatctt 
gtggcagtga 
atacttgttc 
acgttctctg 
gctttgtttg 
aacaaccatt 
ctagagaaca 
acaggtgggg 
attaataaaa 



tcccagttga 
tctgatatcc 
ctcagccctc 
gatctttccc 
catgaagctg 
cagaaggcaa 
aaaatggaag 
cgagagaagg 
gacgagactg 
aacatccaaa 
agaagcraat 
gggctaaagg 
tctagcratt 
gcctagtacc 
tttgaactgt 
accagargct 
crtcttttga 
cagatgcggc 
ccccgtctct 
gatgccrcag 
ttttgttctg 
gttgagagag 
catgtcctca 
crttatgtac 



ttgrgcagaa 
aggrgaaaga 
ggtcaaaaga 
tggaggaaat 
aggtcttgaa 
tagaagagaa 
ctaataaaga 
ataagcacat 
aagctgacta 
gactgtactg 
gtaggactgt 
ggagaaactg 
tttcttgttg 
ctattggcrc 
tagatgctga 
actggacttg 
gttaggttaa 
ctttggggct 
tgccccagtg 
cccaggtcag 
tgtccctttt 
aatctcaaga 
tcctttcctg 
tgctgtagc 



tacactgcct 
actggagaag 
acctgttcca 
tcagaagaaa 
gcagccggct 
caacaacttc 
gaaccgagag 
rgaagaagtg 
atttgttctg 
gccagtgtca 
acaggcagat 
aaagtgtttt 
catcttttct 
cgtgaaaaca 
atatctgttc 
aatggttaat 
raaatcaagc 
ggtagaaata 
tggtttgcat 
ctgttacttt 
aaaaggcaga 
rtctacttgg 
ccacaaaagc 
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cDNA 

TCGAGATCCATTGTGCTCTAAAGGCTCGCCCTCCTGTGCATCGCGGCTAATTTGGGGTAT 

CACTGAGCTGAAGACAAAGAGAAGGGGGAGAAAACCTAGCAGACCACCATGTGCTATGGG 

■\AGTGTGCACGAT GCATC GGACATTCTCTGGTGGGGCTCGCCCTCCTGTGCATCGCGGCT 

AATATITTGCTTTAC TTTC CCAATGGGGAAACAAAGTATGCCTCCGAAAACCACCTCAGC 

CGCTTCGTGTGGTTCTTTTCTGGeATCGTAGGAGGTGGCCTGCTGATGCTCCTGCCAGCA 

TTTGTCTTCATTGGGCTGG AACA GGATGACTGCTGTGGCTGCTGTGGCCATGAAAACTGT 

GGCAAACGATGTGCGATGCl 1 ICl ICTGTATTGGCTGCTCTCATTGGAATTGCAGGATCT 

GGCTACTGTGTCATTGTGGC AGCC CTTGGCTTAGCAGAAGGACCACTATGTCTTGATTCC 

CTCGGCCAGTGGAACTACACCnTGCCAGCACCGAGGGCCAGTACCTTCTGGATACCTCC 

ACATGGTCCGAGTGCACTGAACCCAAGCACATTGTGGAATGGAATGTATCTCTGTnrCT 

ATCCTCTTGGCTCTrGGTGG AATTGA ATTCATCTTGTGTCTrATTCAAGTAATAAATGGA 

GTGCTTGGAGGCATATGTGGCITTTGCTGCTCTCACCAACAGCAATATGACTGCTAAAAG 

AACCAACCCAGGACAGAGCCACAATCTTCCTCTATITCATTGTAATTTATATATrTCACT 

TGTATTCATTTGTAAAACTTrGTATTAGTGTAACATACTCCCCACAGTCTACTTTTACAA 

ACGCCTGTAAAGACTGGCATCirCACAGGATGTCAGTGTTTAAATTTAGTAAACTTCTTT 

TTTGTTTGTTTATTrGTGTAACATACTCCCCACAGTCTACTTTTACAAACGCCTGTAAAG 

ACTGGCATC TrCACAGGA TGTCAGTGTTTAAATTTAGTAAA CL ICJ 1 1 11 I G l 1 101 1 l A 

TTTGTTTTTGl 1 1 1 1 1 1 1 1 AAGGAATGAGGAAACAAACCACCCTCTGGGGGTAGITTACA 

GACTGAGTGACAGTACTCAGTATATCTGAGATAAACTCTATAATGrnTGGATAAAAATA 

ACATTCCATGGCACATATATACAATAGTGATTGGCTTTAGAGCACAAT 



AMINO ACID 

MCYGKCARCIGHSLVGLALLClA.'iNILLYFPNGETK.YASENHLSRPVWFFSGIVGGGLLM 
LLPAFVnGLEQDDCCGCCGHENCGKRCAMLSSVLAALIGlAGSGYCVrVAALGLAEGPL 
CLDSLGQWNYTFASTEGQYLLDTSTWSECTEPKHIVEWNVSLFSILLALGGIEFILCLIQ 
VINGVLGGICGFCCSHQQQYDC* 
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cDNA 

GTC3ACCCACGCGTCCGCGCQAGCTAAGCAGGAGGCGGAGGCGGAGGCGGAGGGCGAGGGGCGGGGAGCGCCGCCTGGAG 

CGCGGCAGGTCATATTGAACATTCCAGATACCTATCATTACTCGATGCTGTTGATAACAGCAAGATGGCTTTGAACTCAG 

GGTCACCACCy^GCTATTGGACCTTACTATGAAAACCATGGATACCAACCGGAAAACCCCTATCCCGCACAGCCCACTGTa 

GTCCCCACrGTCTACGAGGTGCATCCGGCTCAGTACTACCCGTCCCCCGTGCCCCAGTACGCCCCGAGGGTCCTGACGCA 

GGCrrCCAACCCCGTCGTCTGCACGCAGCCCAAATCCCCATCCGGGACAGTGTGCACCTCAAAGACTAAGAAAGCACTGT 

GCATCACCTTGACCCTGGGGACCTTCCTCGTGGGAGCTGCGCrGGCCGCTGGCCTACTCTGGAAGTTCATGGGCAGCAAG 

TGCTCCAACTCTGGGATAGAGTGCGACTCCTCAGGTACC7GCATCAACCCCTCTAACTGGTGTGATGGCGTGTCACACTG 

CCCCGGCGGGGAGGACGAGAATCGGTGTGrrCGCCTCTACGGATCAAACTTCATCCTTCAKTTGTACTCATCTCAGAGGA 

AGTCCTGGCACCCTGTGTGCCAAGACGACTGGAACGAGAACTACGGGCGGGCGGCCTGCAGGGACATGGGCTATAAGAAT 

AArrrrrACTCTAGCCAAGGAATAGTGGATGAttGCGGATCCACCAGCTTTATGAAACTGAACACAAGTGCCGGCAATGT 

CGATATCTATAAAAAACTGTACCACAGTGATGCCT G I I Zl i 'CAAAAGCAGTG C i i I ' C i ' lTA CGCTGTATAGCCTGCGGGG 

TCAACTTGAACTCAAGCCGCCAGAGCAGGATrGTGGGCGGCGAGAGCGCGCTCCCGGGGGCCTGGCCCTGGCAGGTCAGC 

CTGCACGTCCAGAACGTCCACGTGTGCGGAGCCTCCATCATCACCCCCGAGTXX3ATCGTGACAGCCGCCCACTGCGTGGA 

AAAACCTCTTAACAATCC ATGG CATTGGACGGCaTTTGCGGGGATTTTGAGACAATCTTTCATGTTCTATGGAGCCGGAT 

ACCAAGTAGAAAAAGTGATTTCTCATCCAAATTATGACrCCAAGACCAAGAACAATGACATTGCGCTGATGAAGCT 

AAGCC TCTG ACTTTCAACGACCTAGTGAAACCAGTGT3TCT3CCCAACCCAGGCATGATGCTGCAGCCAGAACAGCTCTG 

CTGGATTTCCGGGTGGGGGGCCACCGAGGAGAAAGGGAAGACCTCAGAAGTGCrGAACGCTGCCAAGGTGCTTCTCATTG 

AGACACAGAGATGCAACAGCAGATATGTCTATGACAACCTGATCACACCAGCCATGATCTGTGCCGGCTTCCTGCAGGGG 

AACGTCGATTCTTGCCAGGGTGACAGTGGAGGGCCTCTGGTCACTTCGAAGAACAATATCTGGTGGCTGATAGGGGATAC 

AAGCTGGGGTTCTGGCTGTGCCAAAGCTTACAGACCAGGAGTGTACGGGAATGTGATGGTATTCACGGACTGGATTTATC 

GACAAATGAGGGCAGACGGCr AAT CCACATGGTCTTCGTCCTTGACGTC Cjli ' i ' lA CAAGAAAACAATGGGGCTG Gi ' i ' lT G 

CTTCCCCGTGCATGATTTACTCTTAGAGATGATrCAGAGGTCACTrCATTTTTATTAAACAGTGAACTTGTCrGGCTTTG 

GCACTCTCTGCCATTCTGTGCAGGCTGCAGTGGCTCCCCTGCCCAGCCTGCTCTCCCTAACCCCTTGTCCGCAAGGGGTG 

ATGGCCGGCTGGrrGTGG GCAC TGGCGGTCAAGTGTGGAGGAGAGGGGTGGAGGCTGCCCCATTGAGATCTTCCTGCTGA 

GTCCTTTCCAGGGGCCAATTTTGGATGAGCATGGAGCTGTCACCTCTCAGCTGCTGGATGACrTGAGATGAAAAAGGAGA 

GACATGGAAAGGGAGACAGCCAGGTGGCACCTGCAGCGGCTGCCCTCTGGGGCCACTTGGTAGTGTCCCCAGCCTACCTC 

TCCACAAGGGGATTITGCTGATGGGTTCTTAGAGCCTTAGCAGCCCTGGATGGTGGCCAGAAATAAAGGGACCAGCCCTT 

CATGGGTGGTGACGTGGTAGTCACl'a'UTAACGGGAACAGAAACA iniimTCiTA TGGGGTGAGAATATAGACACrro 

CrrGGTGCGAGGGAAGCAATTGAAAAGGAACTTGCCCTGAGCACTCCTGGTGCAGGTCTCCACCTGCACATTGGGTGGG^ 

CTCCTGGGAGGGAGACTCAGCCTTCCTCCTCATCCTCCCTGACCCTGCTCCTAGCACCCTGGACAGTGCACATGCCCCTT 

GGTCCTGGCAGGGCGCCAAGTCTGGCACCATGTTGGCCTCTTCAGGCCTGCTAGTCACTGGAAATTGAGGTCCATGGGGG 

AAATCAAGGATGCTCAGTTTAAGGTACACrGTTTCCATGTTATGTTrCTACACATTGCTACCTCAGTGCTCCTGGAAACT 

TAGC7T T TGATGTCr CCAAGTAGTCCACCTTCATTTAACTCTrTGAAACTGTATCATCTTTGCCAAGTAAGAGTGGTGGC 

CTATTTCAGCTGCrTTGACAAAATG ACTGG C TCCT GACTTAACGTTCTATAAATGAATGTGCTGAAGCAAAGTGCCCATG 

GTGGCGGCGAAGAAGAGAAAGATUiUirrTGTTTTGGACrCTCTGTGGTCCCTTCCAATGCTGTGG uriT CCAACCAGGG 

GAAGGGTCCCTTTTGCATTGCCAAGTGCCATAACCATGAGCACrACTCTACCATGGTTCTGCCTCCTGGCCAAGCAGGCT 

GGTTTGCAAGAATGAAATGAATGATTCIACAGCTAGGACrTAACCTrGAAATGGAAAGTCTTGCAATCCCATTTGCAGGA 

TCCGTCTGTGCACATGCCTCTGTAGAGAGCAGC»TTCCCAGGGACCTTGGAAACAGTTGGCACTGTAAGGTGCTTGCTCC 

CCAAGACACATCCTAAAA GGTGTTG TAATGGTGAAAACGTCTTCCTTCTrTATTGCCCCTTC'rrATTTATGrGAACAACT 

GTTTGTCTTrTTTTGTATCTTTTTTAAACTGTAAAGTTCAATTGTGAAAATGAATATCATGCAAAT;^ 

TTTTTTCAAAGTAAAAAAAAAAAAAAAAAAAAAAAGGGCGGCCGC 

AMINO ACID 

MAUJSGSPPAIGPyir=:>mGYQPEllPYPAQ?TVVPTVYEVHPAQYYT>SPVPQYA?RVLTQA 
SNPWCTQPKSPSGTVCTSKTKKALCITLTLGTFLVGAAIAAGLLMKFMGSKCSNSGIEC 
DSSGTCINPSNWCDGVSHCFGGEDENRCVRLYGSNFILQVYSSQRKSWHPVCQDDWJIENY 
GRAACRDMGYKMNr^SSQGIVDDSGSTSFMKLNTSAGNVDIYKKLYHSDACSSKAWSLR 
C lACGVNLWSSRQSRIVGGESALPGAWPWQVSLHVQNVHVCGGS IITPEWIVTAAHCVEK 
PLimPKKW7ArAGIIJlOSETir!fGAGYQVEmSHP>rn)SKTmnjIALMKLQKPLTETnDL 
VKPVCLP^fPGMMI^?SQLCWISGWGATEEKGKTS^:VLNAAiCVLLIE7QRCNSRYVYDNLI 
TPAMICAGFLQGNVDSCQGDSGGPLVTSKNNIWWLIGDTSWGSGCAKAYRPGVYGNVMVF 
TDWIYROMRADG* 
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cDNA 

GTCGACCCACGCGTCCGGCGTAGTGCGCTCCCCGCCCAGCCTGCAGAGCTCGCGCCGCGGCAG 

CCCAGCCGCTCGGCCCCGCCGCGCTCGCAGAGGCCGCCATGGGCACCGCGCGCTGGCTCGCGC 

TGGGCAGCCTCTrCGCCCTGGCTGGGCTGCTGGAAGGCCGGCTCGTGGGCGAGGAGGAAGCCG 

GCTTTGGCGAATGTGACAAGTTCTTCTACGCCGGGACCCCGCCTGCGGGGCTGGCGGCCGATT 

CCCACGTGAAGATCTGTCAGCGCGCGGAGGGTGCTGAGCGCrTCGCCACCCrCTACAGCACCC 

GGGACCGCATCCCCGTGTACTCCGCGTrCCGCGCCCCGCGCCCTGCGCCCGGCGGCGCCGAGC 

AGCGATGGCTGGTGGAGCCGCAGATCGATGACCCCAACAGCAACCITGAGGAGGCGATTAAT 

GAGGCAGAGGCCATCACCTCrGTGAACAGCCTGGGAAGCAAGCAAGCCnTGAATACAGATTA 

CCTTGATTCTGATrACCAAAGAGGACAGCnTACCCATrCTCCCTTAGCAGTGATGTCCAGGTG 

GCCACAnTACTCTCACAAATrCAGCCCCAATGACTCAGTCCrrCCAGGAACGGTGGTATGTG 

AATCTCCACAGCCTAATGGACCGGGCnTGACCCCACAGTGTGGCAGTGGGGAAGACCrATA 

TATCCTCACAGGCACAGTGCCCTCAGACTACAGAGTTAAAGACAAAGTGGCAGTCCCTGAGTr 

TGTTTGGCTGGCAGCCrGTTGTGCrGTCCCTGGAGGAGGCrGGGCCATGGGCnTGTCAAGCA 

CACCCGGGACAGTGACATCATAGAAGATGTGATGGTAAAAGATCTTCAGAAACTGCTrCCATT 

TAACCCTCAGCTGTTTCAGAACAACTCTGGTGAAACTGAGCAAGACACAGAGAAAATGAAAA 

AAATCCTGGAAGTGGTTAACCAAATCCAGGATGAAGAACGAATGGTACAATCTCAAAAGAGT 

TCTAGTCCCCTT TCTAGC ACCAGGAGCAAGAGGTCTACTCTGTrGCCTCCAGAGGCATCTGAG 

GGAAGTAGTAGCTnTTGGGAAAACTCATGGGCn-CATTGCTACCCCATrCATCAAGCTrnTC 

AATTAATITArrACCTTGTGGTAGCAATCCTGAAGAACA-rTGTCTATTrCCTGTGGTGTGTrAC 

CAAGCAGGTGATTAATGGCATAGAAAGTTGCCnTACCGCCTGGGCTCAGCCACCATCTCATA 

CTTCATGGCCATTGGGGAAGAGTTGGTGAGCATTCCCTGGAAGGTGCTCAAGGTCGTGGCCAA 

AGTCATCAGGGCTCTCCTCCGGATCCTTTGTrGTCTGCTGAAGGCCATn-GCCGAGTrCTGAGC 

ATCCCTGTCCGTGTCCrrGTGGATGTGGCCACTTTCCCTGTGTACACCATGGTCGCTATTCCAA 

TTGTTTGCAAGGACATTGCACTGGGCCrrGGTGGCACTGTCTCACTGCTCTTrGACACTGCTTT 

T GGTAC CCTGGGTGGCCTATTrCAGGTGGTTTTTAGTGTCTGCAAGCGGATTGGCTACAAGGTT 

ACTITTGACAATrC TGGGG A GTTA TAAACTCAAAAAACTAATAGTATCCAGTCACAGTGAATr 

TGAAAGCTGGAATAGTTTGTCTT TACAAT GGGTITCrGTrCACTGTCVGTrATCATTATATTTTG 

GCCTTTGGTGGGGATGTCTGCTTGi 1 1 1 1 GCAAAAGAAGATGGCAGAATTTAGACrTGACAGA 

GGAGAAATGCTCAGGGTGAGATTAGGTGTAGTAATCTGCTGTITACCTCCAGTrATATGTGCA 

AACTCCCAAGCCACTAATAACTTCAGTTATGCACTCTAACACAGACGACCACCTGAAATGCAC 

TGGTATTTATTTCTGATAATTAAAAATTACAGGGGAGGGAAGAACTAGAAAAAGAACAACTTT 

AGACCAAAGGTGTCTGAGAAAAGGAGAAAGGGAGCf 101 ICriCCCATTGCTCTTTGTGATTT 

AAGGCAAAACAGATTAAAAAAAAAATCTGCAGCCAATTTCTTGGCATrrGCTTCTCTITCTCC^ 

CAACTC GACT GACCTTGGTGGA ATGC AGATAATGCCTCTTTGTTGAAATAACnTATGGGAAT 

GTACAArmC TATAT CTCrr AGCTTTCCGT GGTrCTCAAGGATATGTACAGTmCATTTTCCT 

CCAAAGTTGAATTTTG CTACA i 1 1 1 i Ul l ri'AGGTAATGATAAGCATTTTTTAAAAAATCATnT 

TA GGTAAIGGTAAGCAT nTATGCCAAATGrGGCATAACAGAGTITGAATTGAAGGGCAAAGT 

I I n,i I nui 1 1 1 1 1 1 1 1 iGGCCCCTTGAATGGTATAATACAGTCCTCTCCGGTGCAAAGAAGAG 

aagagaaggtggacagccctgcrcttagtaggtgctgcagatccaaggacatcmtgtccag 

cttggattaacttgacgtgtatccctgcctgacaaggtrgaactgaaagatctatatgttaagc 

taacatgaaaattcatattctgcaacatagtagattitrctaatgcatgaaataagtacccag 

cacagtaaaaatactctgacttatgtccctaaatggttgttttgatacaActatatagaaaag 

agccacaaaataaagataaaagt rattg tggccatctctgaaaaaaatatataaaatatttaa 

gaataattatatcrraggaaatratmtacagtgtgttrgaggctacaaacataactccccca 

ttaatacaaatraaatgtgagagctcattctcaaaaritititgatcaagcactrgtcatm^ 

aatcttgca ctaaa aaatgg taac aagagggaccaaacntgcttcccacaattgggatggaa 

tcacctggatttttctgaatgrntaaagaattgctgagaggtagaaacagccaagtatgaaa 

tactgatcttggggctaccgcccaggatcaatcagaaagttatatgcaaaaaitcggggtccc 

aacaaggagaga acaat atgtcaaccatttgcatggggtcgatggatgagaaacatgagcgt 

agcaagagttacatntgcagaaaaatagcggaggcacacccagagtagaacagcagccagc 
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AMINO ACID 

MGTARWUaGSlJALAGLLEGRLVGEEEAGFGECDKFFYAGTPPAGLAADSHVKICQRAE 

GAERFATLYSTRDRPVnfSAFRAPRPAPGGAEQRWLVEPQIDDPNSNLEEAINEAEAITS 

VNSLGSKQALNTDYLDSDYQRGQLYPFSLSSDVQVATFILTNSAPMTQSFQERWYVNLHS 

LMDRALTPQCGSGEDLmTGTVPSDYRVKDKVAVPEFVnviJVACCAWGGGWAMGFVKHT 

RDSDIIEDVMVKDLQKlXPFhnKJLFQWCGETEQDTEKMKKII^VVNQIQDEERmQSQK 

SSSPI^STOSKRSmPPEASEGSSSFLGKmCFUTPFIKIJQLIYYLVVAlLKNIVYF 

LWCVTKQVmGmSCLYRLGSATISYFMAIGEELVSIPWKVUiWAKVIRALLRILCCLL 

KAICmVLSIPVRVLVDVATFPVYTMVAIPIVCKDIALGLGGTVSLLFDTAFGTLGGLFQV 

VFSVCKWGYKVTFDNSGEL* 

FIG. 4B 
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cDNA 

TCGACCCACGCGTCCGGCGTAGTGCGCTCCCCGCCCAGCCTGCAGAGCTCGCGCCGCGGCAGC 

CCAGCCGCTCGGCCCCGCCGCGCTCGCAGAGGCCGCCATGGGCACCGCGCGCTGGCTCGCGC 

TGGGCAGCCTCTTCGCCCTGGCTGGGCTGCTGGAAGGCCGGCTCGTGGGCGAGGAGGAAGCC 

GGCTTTGGCGAATGTGACAAGTTCTTCTACGCCGGGACCCCGCCTGCGGGGCTGGCGGCCGAT 

TCCCACGTGAAGATCTGTCAGCGCGCGGAGGGTGCTGAGCGCTTCGCCACCCTCTACAGCACC 

CGGGACCGCATCCCCGTGTACTCCGCGTrCCGCGCCCCGCGCCCTGCGCCCGGCGGCGCCGAG 

CAGCGATGGCTGGTGGAGCCGCAGATCGATGACCCCAACAGCAACCTTGAGGAGGCGATTAA 

TGAGGCAGAGGCCATCACCTCTGTGAACAGCCTGGGAAGCAAGCAAGCCTTGAATACAGATT 

ACCTTGATTCTGATTACCAAAGAGGACAGCnTACCCATTCTCCCTTAGCAGTGATGTCCAGGT 

GGCCACATTTACTCTCACAAATTCAGCCCCAATGACTCAGTCCTTCCAGGAACGGTGGTATGT 

GAATCTCCACAGCCTAATGGACCGGGCTTTGACCCCACAGTGTGGCAGTGGGGAAGACCTATA 

TATCCTCACAGGCACAGTGCCCTCAGACTACAGAGTrAAAGACAAAGTGGCAGTCCCTGAGTT 

TGTTTGGCTGGCAGCCTGTTGTGCTGTCCCTGGAGGAGGCTGGGCCATGGGCTTTGTCAAGCA 

CACCCGGGACAGTGACATCATAGAAGATGTGATGGTAAAAGATCTTCAGAAACTGCTTCCATT 

TAACCCTCAGCTGTTTCAGAACAACTGTGGTGAAACTGAGCAAGACACAGAGAAAATGAAAA 

AAATCCTGGAAGTGGTTAACCAAATCCAGGATGAAGAACGAATGGTACAATCTCAAAAGAGT 

TCTAGTCCCCTTTCTAGCACCAGGAGCAAGAGGTCTACTCTGrrGCCTCCAGAGGCATCTGAG 

GGAAGTAGTAGCl l l I IGGGAAAACTCATGGGCTTCATTGCTACCCCATTCATCAAGCrri'nC 

AATTAATTTATTACCTTGTGGTAGCAATCCTGAAGAACATTGTCTATTTCCTGTGGTGTGTTAC 

CAAGCAGGTGATTAATGGCATAGAAAGTTGCCTTTACCGCCTGGGCTCAGCCACCATCTCATA 

CTTCATGGCCATTGGGGAAGAGTTGGTGAGCATTCCCTGGAAGGTGCTCAAGGTCGTGGCCAA 

AGTCATCAGGGCTCTCCTCCGGATCCmGTTGTCTGCTGAAGGCCATrTGCCGAGTTCTGAGC 

ATCCCTGTCCGTGTCCTTGTGGATGTGGCCACTTTCCCTGTGTACACCATGGTCGCTATTCCAA 

TTGTTrGCAAGGACATTGCACTGGGCCTTGGTGGCACTGTCTCACTGCTCTTTGACACTGCnr 

TGGTACCCTGGGTGGCCTATTTCAGGTGUl rrriAGTGTCTGCAAGCGGATTGGCTACAAGGTT 

ACi'Ill GACAATTCTGGGGAGTTATAAACrCAAAAAACTAATAGTATCCAGTCACAGTGAATT 

TGAAAGCTGGAATAGTTTGTCTTTACAATGGGTTTCTGrrCACTGTCAGTTATCATrATATTTT 

GGCCTTTGGTGGGGATGTCTGCTTGTTTrTGCAAAAGAAGATGGCAGAATTTAGACTTGACAG 

AGGAGAAATGCTCAGGGTGAGATTAGGTGTAGTAATCTGCTGTTTACCTCCAGTTATATGTGC 

AAACTCCCAAGCCACTAATAACTTCAGTTATGCACTCTAACACAGACGACCACCTGAAATGCA 

CTGGTATTrATTTCTGATAATTAAAAATTACAGGGGAGGGAAGAACTAGAAAAAGAACAACT 

TTAGACCAAAGGTGTCTGAGAAAAGGAGAAAGGGAGCTTCTTCTTCCCATTGCTCTTTGTGAT 

TTAAGGCAAAACAGATTAAAAAAAAAATCTGCAGCCAA 1 ' 1 1 C 1 ' 1 GGCATTTGCTTCTCTTTCTC 

CTC.\ACTCG ACTG ACCTTGGTGG AATGCAGATAATGCCTC HIGH GAAATAACTTTATGGG A 

ATGTACAATTTTCTATATCTCTTAGCTTTCCGTGGTTCTCAAGGATATGTACAGTTTTCATnTC 

CTCCAAAGTTGAATnTGCTACAl'1'lTlfrn'l'AGGTAATGATAAGCAiTl 1 1 lAAAAAATCAT 

TTTTAGGTAATGGTAAGCATTTTATGCCAAATGTGGCATAACAGAGTTTGAATTGAAGGGCAA 

AGTmcrriicrri 1 1 1 1 1 1 iggccccttgaatggtataatacagtcctctccggtggaaaga 

AGAGAAGAGAAGGTGGACAGCCCTGCTCTTAGTAGGTGCTGCAGATCCAAGGACATCl 1 1 TGT 

CCAGCTTGGATTAACTTGACGTGTATCCCTGCCTGACAAGGrrGAACTGAAAGATCTATATGT 

TAAGCTAACATGAAAATTCATATTCTGCAACATAGTAOA lllTl CTAATGCATGAAATAAGTA 

CCCAGCACAGTAAAAATACTCTGACTTATGTCCCTAAATGGTTGTTTTGATACAACTATATAG 

AAAAGAGCCACAAAATAAAGATAAAAGTTATTGTGGCCATCTCTGAAAAAAATATATAAAAT 

AnTAAGAATAATTATATCTTAGGAAArrA 1 1' I'll ACAGTGTGTTTGAGGCTACAAACATAACT 

CCCCCATTAATACAAATTAAATGTGAGAGCTCATTCTCAAAATTTTTTTGATCAAGCACTTGTC 

ATTTTAAATCrTGCACTAAAAAATGGTAACAAGAGGGACCAAA(nTTGCTTCCCACAA"rTGGG 

ATGGAATCACCTGGATrmCTGAATGTTTTAAAGAATTGCTGAGAGGTAGAAACAGCCAAGT 

ATGAAATACTGATCTTGGGGCTACCGCCCAGGATCAATCAGAAAGTTATATGCAAAAATTCGG 

GGTCCCAACAAGGAGAGAACAATATGTCAACCATTTGCATGGGGTCGATGGATGAGAAACAT 

GAGCGTAGCAAGAGTTACATnTGCAGAAAAATAGCGGAGGCACACCCAGAGTAGAACAGCA 

GCCAGCAAGCTGCCATCCTGATCAATTTGTGATGCAAGGTTAGGGAGTATGAGCACCGCATGG 

GTCCATGCTAGGGAGATGTGCACCAGGCTTAGCTAAGAAACITAAAGCAGTATTTTACCAACT 

CTTGTCTTAGGGAGCATAAAGTTTGGATGTCCCTITATTTCAGCAGTGTGAAGGTAAATGGAA 
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GGGTGAGGGTrrCACnTGGTACTGATrGGTCAAGAATCCTGCrGGATAAAGAAAAGGAArnT 

AGAAGTGACCAGAGGGACAGTCCAGCCAAACTATTATTTGATAAGGAACCCAAGGCCCTGGG 

AGGCAACAGGCTAGCCCTTAGTTTCATGGCTAGCTGAGAGCAGATrAGGAGCCAACGTTGTTT 

GCACATGTCCCATCACACCTGAGATGTCAGACATGGGAAGTTCGTGCTAITAITCAGTrGCCT 

CTCTGGACCATGGCAAGA-nTCCTCATTCATCAAACAGACTCCAGGCCCTGACAAAGCAG-rTG 

GATTTGGCATGTGTGATGAAAACAATTAGCCATCCATGTACAACArrATGCITACTGCATCCC 

ATGGAAACTTAATTCCCTCCTAGCACATATrrGCATACTGAAAGGTCCGAAAAGGGCATCCAC 

TTGGGGAATGT^i i icm m iCACATCCAATTGTTTAGTGTCTATTTArrcrrGGGTGGCCAGTr 
II°^^5^'^^:^^^^°^^^'^'^CCAAAAAGT^^ 

CTGCCCTGAGCTACrAGTGTGGCTGTGCCTGTGGGTCTCTAGAACCATCTGCATTGGACG^^^^^ 
AGCCACAGCAGGTGGCTGGACTGCTGGCCTGTrCCTAATGAGCTACGCTCGG<nTrGAGGTAG 
AGGTTGGGGTTTATGAACCCCAACTCTGGCTrAAAGATCTTGCTGTGGCTCTGTTATGTrCTGA 
GGCCTTGGGATTAGCCTCTTCCTCATTATGGAGCTGATnTCT^ 

TTTGGACACITCTCTTTTCCArrGTGCCTTrTGAATGTrGTCTTCTCACn-CAGCATC^ 

GATCTAAATGCAGACTAGGTAGTTGGGAGGAGGAACCAAAGTGAACCATCCTTCATrTATTCA 

GTCATTCGTrCATCTGTCAAACACGTATTTGGACATCAAGGTTGCAGAGATGAACAATGCATG 

GATTTCATCnTGAGGAGTTCAAAACCTAGTGGAGAGAACACATGGTACAATCGTAACACATG 

AAGGACAAGTAAGTGCTGCAGTAAAGGTACTAATAACATGTTCCTTGGAACAGAGGAAGAAA 

AACCACGAAACCATGGAAATTAGGGAAGCCTTrACAGAGGGTGTGACAAAACTCAA-nTGAC 

ATTTTCAAGCTATGTACAATGATGTGCACCTTGCAGATGCTCAATAAAGTAATTACTGACAAA 
AAAAAAAAAAAAAAAAAGGGCGGCCCC 

AMINO ACID 

MGTARWLALGSLFALAGLLECRLVGEEEAGFGECDKFFYAGTPPAGLAADSHVKICQRAE 

GAERFATLYSTODRIPVYSAfRAPRPAPGGAEQRWLVEPQIDDPNSNLEEAINEAEArrS 

VNSLGSKQALNTDYLDSDYQRGQLYPFSLSSDVQVATFTLTOSAPMTQSFQERWYVNLHS 

LMDRALTPQCGSGEDLYILTGTVPSDYRVKDKVAVPEFVWLAACCAVPGGGWAMGFVKHT 

RDSDIIEDVMVKDLQKLLPFNPQLFQNN'CGETEQDTEKMmLEVVNQIQDEERMVQSOK 

SSSPLSSTRSKRSTLLPPEASEGSSSFLGKLMGFIATPFKLFQLIYYLWAILKl'JIVYF 

LWCVTKQVINGIESCLYRLGSATISYFMAIGEELVSI?WKVLKWAKVIRALLRn.CCLL 

KAICRVXSIPVRVLVDVATFPVYTMVAIPIVCKDIALGLGGTVSLLFDTAFGTLGGLFOV 
VFSVCKJUGYKVTFDNSGEL* 

FIG.5B 
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AMINO ACID 

MSQAEFEKAAEE\TUn,KTKPSDEEMLFIYGHYKQATVGDI>n^RPGMLDFTGKAKWDAWN 
ELKGTSKEDAMKAYINKVEELKKKYGI' 



cDNA 

GGrrCCTGGTCCTCGCCTCCTCCGCTGTCTCCCTGGAGTTCTTGCAAGTCGGCCAGGATGTCTC 

AGGCTGAGnTGAGAAAGCTGCAGAGGAGGTTAGGCACCTTAAGACCAAGCCATCGGATGAG 

GAGATGCTGTTCATCTATGGCCACrACAAACAAGCAACrGTGGGCGACATAAATACAGAACG 

GCCCGGGATGTTGGACTTCACGGGCAAGGCCAAGTGGGATGCCTGGAATGAGCTGAAAGGGA 

CTTCCAAGGAAGATGCCATGAAAGCTTACATCAACAAAGTAGAAGAGCTAAAGAAAAAATAC 

G GGATATGA GAGACTGGATTTGGTTACTGTGCCATGTGTTTATCCTAAACTGAGACAATGCCT 

TGTmTTTCTAATACCGTGGATGGTGGAA 
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CGGCCGCGTCCTCAAGCCGGCACCTGAGCGGCGGAGACGGCTGTAGCACAAGGATCTGCA 

TCTCCAATGGATACTGAGGGGTrrGGTGAGCrCCTrCAGCAAGCrGAACAGCTrcCTGCT 

GAGACTGAGGGCATCTCAGAGCTrCCCCATGTGGAACGGAACTTACAGGAGATCCAGCAG 

GCGGGAGAGCGCCTGCGTTCCCGTACCCTAACACGCACGTCCCAGGAGACGGCAGATGTC 

AAGGCGTCAGTrCTCCTCGGGTCTCGGGGACTTGACATATCCCACATCTCCCAGCGATTG 

GAGAGTCTGAGTGCAGCCACCACCnTGAGCCTCTTGAGCCTGTGAAGGACACTGACATT 

CAGGGCrrCCTGAAGAATGAGAAGGACAATGCCCTGCTGTCTGCCATCGAAGAGTCCCGG 

AAGAGGACCrrCGGCATGGCTGAGGAGTACCATCGGGAGTCAATGTTGGTTGAGTGGGAG 

CAAGTGAAACAGCGAATTCTGCACACACTGCTGGCATCAGGAGAAGACGCCCTTGACnT 

ACTCAAGAAAGCGAGCCAAGCTACATCAGTGATGTGGGACCCCCTGGTCGAAGCTCTCTG 

GATAACATCGAGATGGCCTATGCGCGGCAAAnTATATCTATAATGAGAAAATTGTAAAT 

GGACACCrGCAGCCTAACCTGGTGGACCnTGTGCTTCCGTCGCAGAGCTGGATGATAAG 

AGCAnTCCGACATGTG'GACCATGGTAAAACAAATGACAGACGTGTTGTTGACACCGGCA 

ACGGATGCCCTGAAGAACCGCAGCAGCGTGGAAGTGCGCATGGAGTITGTCAGGCAGGCC 

TTCGCGTACCTrGAGCAGAGTTATAAGAATTACACCCTrGTGACTGTCTITGGAAATTTG 

CATCAGGCCCAGCTGGGCGGGGTGCCTGGGACTTACCAATrGGTTCGAAGnTCCTGAAC 

ATTAAACTGCCAGCTCCCTTGCCTGGACTACAGGATGGAGAGGTGGAAGGCCATCCTGTG 

TGGGCGCTAATTrACTACTGCATGCGCTGTGGAGACCTGCTrGCCGCITCACAGGTAGTr 

AATCGAGCCCAGCACCAGCTGGGAGAGnTAAAACCTGGTTCCAGGAGTACATGAACAGC 

AAGGACAGAAGATTGTCCCCAGCTACGGAAAACAAGCTCCGGCTGCATTACCGTAGGGCC 

CTCAGGAACAATACAGATCCCTACAAGCGGGCCGTGTACTGTATCATTGGCAGATGTGAC 

GTCACCGACAACCAGAGTGAAGTGGCGGACAAAACTGAGGATTACCTGTGGCTGAAGTTG 

AACCAAGTGTGriTTGACGACGATGGCACCAGCTCCCCACAAGACAGGCTCACrCTCTCA 

CAGTTCCAGAAGCAGTTGTTGGAAGACTATGGCGAGTCCCACTTTACGGTGAACCAGCAA 

CCCTT CCTCTACITCCAAGTCCTGTrCCTGACAGCGCAGTTrGAAGCAGCAGTrGCCnT 

CI 11 iCCGC ATGGA GCGGCTGCGCTGCCATGCTGTCCATGTAGCACrGGTGCrGnTGAG 

CTGAAGCTGCTTTTAAAGTCCTCTGGACAGAGTGCTCAGCTCCTCAGCCACGAGCCTGGT 

GACCCTCCTTGCTrGCGGCGGCTGAACTTCGTGCGGCTCCTCATGCrGTACACCCGGAAG 

TTTGAGTCCACGGACCCAAGGGAGGCCCTCCAGTACTTCTATrrCCTCAGGGATGAGAAA 

GATAGTCAAGGAGAAAACATGTTTCTGCGCTGTGTGAGTGAGCTTGTGATTGAAAGCCGA 

GAGTrCGATATGATTCTTGGGAAACTAGAGAATGACGGAAGTAGAAAGCCTGGAGTCATA 

GATAAGnTACTAGTGACACAAAGCCTATTATCAACAAAGTrGCTTCTGTGGCAGAAAAT 

AAAGGACTGTTTGAAGAGGCV^GCAAAGCTGTATGACCTTGCCAAGAATGCTGACAAGGTA 

CTGGAGCTGATGAACAAACTGCTGAGCCCTGTCGTCCCCCAGATCAGTGCCCCGCAATCC 

AACAAGGAGAGGCTGAAGAACATGGCACTCTCCATrGCCGAACGGTATAGGGCTCAAGGA 

ATAAGCGCAAATAAATTTGTGGACTCCACGTrCTAT Ll ICl 11 I GGACTTGATCACCTTt 

TITGACGAGTATCATAGTGGTCATATTGATAGAGCTnTGATATCATrGAGCGCrTGAAG 

CTGGTGCCCCTGAATCAGGAAAGTGTGGAAGAGAGAGTGGCTGCnTCAGAAATTTCAGT 

GATGAAATCAGGCACAACCTCTCAGAAGTGCTTCTTGCCACCATGAACATCTrGTTCACA 

CAGTTTAAGAGGCTCAAGGGGACAAGTCCATCCTCGTCATCCAGGCCCCAGCGAGTCATC 

GAGGACCGCGACrcrCAACrCCGAAGTCAAGCCCGCACTCTGATTACCrTTGCTGGAATG 

ATACCATACCGAAC GTCT GGGGACACCAATGCGAGGCTGGTGCAGATGGAGGTCCTCATG 

AArrAAGTGCC\TGCTTTGTGGGAGTCTGGGTCGGCACACTGTCAGTACATCAGGCACAT 

GGGCCCACTAGGCTGGGGTTTCTGG l 1 1 lUl l rC TGTTGT Ol 1 1 lUl ri l GGnTCTGTA 

TTATGTA 1 1 1 1 i GTCAACGCCAATAAA 1 11 CriT GAITTGT 
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S^^^;^SSQP5I.VDLCASVAELDDKSISD^«™VKQK^>VU.TPA^1> 
RDSQLRSQARTLrlTAOMIPTOTSGDTOARLVQMEVLMN- 
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CTAGTTTCTAAGGATCATGTCTGCGAGCCAGGATTCCCGATCCAGAGACAATGGCCCrGA 

TGGGATGGAGCCCGAAGGCGTCATCGAGAGTAACTGGAATGAGATTGTTGACAGCTTTGA 

TGACATGAACCTCTCGGAGTCCXTrCTCCGTGGCATCTACGCCTATGGTnTGAGAAGCC 

CTCTGCCATCCAGCAGCGAGCCATTCTACCTTGTATCAAGGGTTATGATGTGATrGCrC^ 

AGCCCAATCrGGGACTGGGAAAACGGCCACATITGCCATATCGATTCTGCAGCAGATTGA 

ATTAGATCTAAAAGCCACCCAGGCCTTGCTCCTAGCACCCACrCGAGAATrGGCTCAGCA 

GATACAGAAGGTGGTCATGGCACTAGGAGACTACATGGGCGCCTCCTGTCACGCCTGTAT 

CGGGGGCACCAACGTGCGTGCTGAGGTGCAGAAACTGCAGATGGAAGCTCCCCACATCAT 

CGTGGGTACCCCTGGCCGTGTGnTGATATGCTTAACCGGAGATACCTGTCCCCCAAATA 

CATCAAGATGTTTGTACTGGATGAAGCrGACGAAATGTTAAGCCGTGGATrCAAGGACCA 

GATCTATGACATATTCCAAAAGCrCAACAGCAACACCCAGGTAGTlTTGCTGTCAGCCAC 

AATGCCTTCTGATGTGCrrGAGGTGACCAAGAAGTrCATGAGGGACCCCATTCGGATTCT 

TGTCAAGAAGGAAGAGTTGACCCTGGAGGGTATCCGCCAGTTCTACATCAACGTGGAACG 

AGAGGAGTGGAAGCTGGACACACTATGTGACrrGTATGAAACCCTGACCATCACCCAGGC 

AGTCATCTTCATCAACACCCGGAGGAAGGTGGACTGGCrCACCGAGAAGATGCATGCTCG 

AGATTTCACTGTATCCGCCATGCATGGAGATATGGACCAAAAGGAACGAGACGTGATTAT 

GAGGGAGTTTCGTTCTGGCTCTAGCAGAGTTTTGATTACCACTGACCTGCTGGCCAGAGG 

CATTGATGTGCAGCAGGl 1 ICl 1 lAGTCATCAACTATGACCTTCCCACCAACAGGGAAAA 

CTATATCCACAGAATCGGTCGAGGTGGACGGnTGGCCGTAAAGGTGTGCCTATTAACAT 

GGTGACAGAAGAAGACAAGAGGACTCTTCGAGACATTGAGACCTTCTACAACACCrCCAT 

TGAGGAAATGCCCCTCAATGTrGCrGACCTCATCTGAGGGGCTGTCCTGCCACCCAGCCC 

CAGCCAGGGCTCAATCTCTGGGGGCTGAGGAGCAGCAGGAGGGGGGAGGGAAGGGAGCCA 

AGGG ATGGACATCTTGTCA 11 1 1 11 1 1 L 1 1 1 G AATAAATGTCAC 1 1 1 1 1 GAGGCAAAAGA 

AGG 
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cDNA 

CCAGCACTGGCTGCAC CATGCA CCTGAACGGCAAAGTGGCGCrGCTGACCGCCGCGGCTC 

AGGGCATAGGCAGAGCCTTTGCAGAGGCGCrGCTCCTTAAGGGCGCCAAGGTAGCGCTGG 

TGGATTGGAATCrrGAAGCAGGTGTACAGTGTAAAGCTGCCCTGCATGAGCAATrrGAAC 

CTCAGAAGACTCTGTrCAT CCAGTG CGATCTGGCrGACCAGCAACAACrGAGAGACACIT 

TTAGAAAAGTTGTAGACCAC I'l lUGAAGACTGGACATnTGGTCAATAATGCTGGAGTGA 

ATAATGAGAAAAACTGGGAAAAAACTCTGCAAAITAAl i ICKil l ICTGTTATCAGTGQAA 

CCTATCTTGGTTTGGATTACATGAGTAAGCAAAATGGAGGTGAAGGCGGCATCATrATCA 

ATATGTCATCmAGCAGGACTC^TGCCCGTTGCACAGCAGCCGGT^TA•^■GTGC^^CAA 

AGCATGGCATAGTTGGATrCACACGCTC AGCA GCGTTGGCrGCTAATCITATGAACAGTG 

GTGTGAGACTGAATGCCATITGTCCAGGCmGTTAACACAGCCATCCrTGAATCAATTG 

AAAAAGAAGAA AACA TGGGACAATATATAGAATATAAGGATCATATCAAGGATATGATTA 

AATACTATGGAATTTTGGACCCACCVTTGATrGCCAATGGATrGATAACACTCATrGAAG 

ATGATGCTrrAAATGGTGCTATTATGAAGATCACAACnTCTAAGGGAATrCATITrC^ 

ACTATGATACAACnrCATrTCAAGCAAAAACCCAATGAACAGCTTATGTGTTAGCCATAG 

CTGAAAATAAGCACAAATAGCTTATATrCAGATCCTATCITCATT^ 

AATGAAATGTTACAGTTTGAAGTTTTCCTTCATGCACTTGGTGATAAACGTnrCTAAAT 

mTAGTTAAGTATATGGATAAAAAGTrATGAACrATTAAAAATGTGATGTGGACCAAAG 

GCrAGGTTGTA ATOrT GATAGTCTAAAAAATGATCAAAACAAATGATnTCAAGGAATAT 

TCAATATTCTG CCTTT CAGAAAGTGTATn'ATATCTGTGCTTCATAAATATrAATGTrCT 

TCAGAACATCArmAAAGGAGATACITGAATTGTTATITAAATCAAACCAGATGTAAAA 

CACTCACATACAAGTTCATACTnAAAAGAGGAAAGCrACTTAACAATGACAAATATrrC 



ACAATAATAAlllllAC rrAT ATACCATCTrrCAACTGAACATTTCAGTrCrrCCAAGAG 

CllL'llAGAGTAGTATATnTGGGGGCAGTCAAGGAATAAACTACAGTGTAAACATATCC 

CAGATGAAAACTGCTGTATGGAAAAATGAOVGAAAGTAACTGATTGACACTGTrGATrCA 

CAGTrCAGCCTCCTATCTGGGAAAGAC Al 1 iC i llt'CTCTGCTCACTTrAAGAACTnTA 

CCGACTCCAAAAATCTCAGGAATrAAACnTTAACAGTTACAGCAATAAAGAATAGTTAG 

TACTCCAAAAAT ATTAT ATITAAGATGCrCAACAAGAAAAAAATGCAAATGTAATATnT 

TTrCAAATrACTrCTTTATTGACTTGTCCAAAmc^^ 

CTTTmATrCTGATCrCCATAAATTACTTAGTCTTCTATGTATAGCT^ 

AAACCAATTITGCCACAGCCACAACTGTAAATGTTmGTACCCATGCrGAAACTCATAA 

CAACACAGACATAAAA ATAGC TGTGAGGnTTG Cl U U 1 lUri GTCAGCrATCTTAAGA 

ATCATTAAATACACCTGCirrGGGTAAAACTCTTTGCAAGCAGTAATTAACACrAGTAAC 

AGTGAAAGCACAAGATITCCAAATCAGTCGTITrCTCAAAAAAATATCGTATAAGTGACT 

CATCCTGTCTGCTAACTCCAGACCTCCCAGCTrGAAGCaVAATCTTTCCATGTGAGATTG 

ATATGGATITCCTAGAAGT ACTGG AATGTrGTCATATCTTGCCCTATTTrAATTCTGCTA 

TAGA AAACAATTGCCTTCACrmAAGGAGTAATITGAATATrAATAACTCTGGTCTAGA 

rrrTCATATAATGTATTAAAG ACAAA GTAGTGAACATCAATGAACATCTGATAGAOATAA 

ACrGTAATCAGGCATAAGCrTGTTTGTATGTTCTGGCAGTGACTAATCAGTAAATGATGT 

CGGTTTGCCCAGTATCACrrATCTTCTGTA'nTAATACATTATATGAAAATCTAGACCAG 

AGTTATTAATATTCAAATrACTCCTTAAAAGTGAAGGCAATTGTmCTATAGCAGAATT 

AAAATAGGCAAG 



AMINO ACID y 

MHVNGKVALVTGAAQGIGIUFAEAlIiKGAKVALVDWm,EAGVQCKAAUIEQFEPQKT^ 

IQa)VAIXJQQLRDTFRKVVDHFGRU)ILVWAGVNNEK>nATEICIXQ^ 

YMSKQNGGEGGmhnvISSl^GIAIPVAQQPVYCASKHGIVGFniSAAlJkANI^ 

ICPGFVKTAILESIEKEENMGQYIEYKDHIKDMIKYYGIIJJPPUANGUTUEDDA^ 

AIMKnrSKGIHFQDYDTrPFQAKTQ* 
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